Gene Franean1_1954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1954 
Symbol 
ID5670355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2350034 
End bp2351758 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content73% 
IMG OID641240875 
Producthypothetical protein 
Protein accessionYP_001506297 
Protein GI158313789 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0113455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.562335 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGAGG CGCTCCTCCT GCTCCTCAGC TTTGCGCTAG TCATCGTATG TGGCATCTTC 
GTCGCGGCCG AGTTCGCCTT CGTGACGGTG GACCGGCCCA CGGTCGAGCG CTCCGCGGCG
GACGGCGACA GCCGCTCGGC GGGCGTCCTC ACCGCACTGC GAAGCCTCTC CACCCAGCTC
TCCGGCGCAC AGCTGGGCAT CACCATCACC AACCTGGCGA TCGGCTTCCT CGCCGAGCCC
GCCATCGCGG ACCTACTTGA AGGTCCCGTC ACGACTCTGG GTGCCTCCGA GGGCCTGGCA
CGTGGTCTCT CCGTGGCCCT GGCACTCGTG CTCGCGACCG CGTTCACGAT GTTGTACGGC
GAGCTCGTCC CCAAGAACCT GGCCATCGCC AGGCCGCTGG GGACGGCGCG CGCCGTCCAG
CGTCCGCAAC GGCTGTTCAC CCGGGTGACC AGCCCGGTGA TCCACTCGCT GAACAGCACC
GCGAACGCGC TGCTGCGGCG GGTCAACGTC GAACCCCAGG AGGAACTGGC GTCGGCACGT
TCCCCGCAGG AGCTGTTCTC GCTGCTCGGC CGGTCGGCCG AGCACGGAAC GCTGCCGCGG
GAGACGGCGA CGCTCATGCA GCGCTCGCTG ACCTTCGGCG ACCGGGTCGC CGAGGACGTG
ATGACCCCCC GGATGCGCAT GCAGTCGATC GACGCGGACG CCCCCGTCGC CGAGGTGATC
AGCGCCGTCC GGCGGACCGG GCACGCCCGG TTCCCCGTCA TCGGCGACGG CAGCGACGAC
GTAGTCGGCC TGATCCATGT CAAGCACGCG GTGAGCGTGC CGGAGGAGCG GCGCGACACG
GTACAGGTGC GCACCGTGAT GATCCCCGCG GCGACGGTGC CGTCCTCGAT GCCCCTCGAG
CCGCTGTTGG AGACGCTGCG CTCGGGCGGC CTGCAGATGG CGATCGTCGT CGACGAGTTC
GGCGGGGTCG ACGGCCTGGT GACGGCGGAG GACCTCATCG AGGAGATCGT CGGTGACGTC
GTCGACGAGC ACGACCGGGT CAGCCCCCGG GCGCTGCGCC GCCGGGACGG CAGCTGGCTC
GTCTCGGGCC TGCTGCGGCC GGAGGAGGCC AGTGAGGTCA CCGGCCTGCC GATCCCGGCC
GACGACGCCT ACCAGACGCT GGGCGGCCTG ATGTCGCGCA CCCTCGGGCG TATCCCCGGC
ACCGGCGACA CGATCGTCCT GGACGGCATC CGCTACGAGG TCGAGCGGAT GGACGGCCGC
CGCGTCGACC GCATCCGGCT CGACCCGCGG GGCGACGCCG CGACGAACCC GGCGCGGGCC
GACATCGCGG AACAGCCCTC GGGCGAGGCG CCAGCCACGA CTCCGCCACC CGCGACCCCG
GCGGACACGA AGGCCACCGA CGAGAATCCA CCGACCGTAG CGCCAGCGAC CACAGCGCCG
GCAGGCACGG CGTCGGCGAG CACAACGGCG GCGGACGCGG CACCGACGGG CGCGGCGCAG
GCAGACACGG CCCCGCCGAC GGGCACAGCG CCAGCGGTGG GCACAGCGCC AGCGGCGGGC
ACAGCGCCAG CGGACACAGC ACCGCCGGAC ACAGCACCGC CGGCGGGCAC GAAGCCGCCG
GGCACCGGGC CTGGACGGGC CCGCGGCCGG GACCGCGCTG ACGACGCGGG TCGGCGCGGG
CGCACGCGGT CGGTCTCAGC CGGGGTCAGG GAGTCCGAGC GATGA
 
Protein sequence
MTEALLLLLS FALVIVCGIF VAAEFAFVTV DRPTVERSAA DGDSRSAGVL TALRSLSTQL 
SGAQLGITIT NLAIGFLAEP AIADLLEGPV TTLGASEGLA RGLSVALALV LATAFTMLYG
ELVPKNLAIA RPLGTARAVQ RPQRLFTRVT SPVIHSLNST ANALLRRVNV EPQEELASAR
SPQELFSLLG RSAEHGTLPR ETATLMQRSL TFGDRVAEDV MTPRMRMQSI DADAPVAEVI
SAVRRTGHAR FPVIGDGSDD VVGLIHVKHA VSVPEERRDT VQVRTVMIPA ATVPSSMPLE
PLLETLRSGG LQMAIVVDEF GGVDGLVTAE DLIEEIVGDV VDEHDRVSPR ALRRRDGSWL
VSGLLRPEEA SEVTGLPIPA DDAYQTLGGL MSRTLGRIPG TGDTIVLDGI RYEVERMDGR
RVDRIRLDPR GDAATNPARA DIAEQPSGEA PATTPPPATP ADTKATDENP PTVAPATTAP
AGTASASTTA ADAAPTGAAQ ADTAPPTGTA PAVGTAPAAG TAPADTAPPD TAPPAGTKPP
GTGPGRARGR DRADDAGRRG RTRSVSAGVR ESER