Gene Franean1_2814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2814 
Symbol 
ID5671203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3329479 
End bp3331239 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content72% 
IMG OID641241723 
Producthypothetical protein 
Protein accessionYP_001507143 
Protein GI158314635 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTTCG TCGAGGAGTC CGGCCGGCGG GAGCGGACGT TCGATTTCGC CGTTCTGCCG 
GTCGAGCGGG ACGTCCAGCG ATGGCTGGCC CGGGTGTTCG CCCGCCGAGC CGGTCCACGT
TCGGCGACGA AACGGATGGG CACCGCCGTG GGGCACTTCG ATGTCCTTCG GGGCTTCGCC
GCGTCGCTCG CCGGGGCGTC ACCGTCCCCG CGCTGCCCGG TCGATCTGCG TCCCGCGCAC
GTCACCGCGT TTCTCCTGCG CTACCGCGGT CAGCCCAGCG AGCGGGAGTA TCTCAAGCGG
CTGCGAGTGC TGCTGCGTGA CGATCCGGAG CTGCCCGAGC CGACGCGAAC GGCGCTGTTG
TCCGCACGGC TCGCGCCGGC GGCGCCGGCG AACCCGCTTG TCGGCTACAG CGACACGGAA
TGGCAGCAGA TCATGACCGC GGTACGGCGG GACGTGCGGC TGGCCCGAGA CCGTATCCGG
GCGAGCCGCC AGCTGCTCGA TCGTTTCCGC GTCGGCGCCG TGCCGTCCGA GGGCCCGGAG
AGCGGGCTCG CCGTGTTGCT GGACGTCTTC GACCGCACCG GTGATCTTCC TCGGATCGAC
TCGGGCGGGC ATTCCCGAGC CGTTCGGGAC GCCGGTGGCA TGACCGCCAT CGGTGGGCGG
CTGTGCCTGT CCAGCGACGA GGCGGTCGCG TTCTGCCTGC TGCTGGTCGC GTTGACCGGG
GAGAACTTCG GCACCGTCGC CGCGTGGCCG GCGGCACACC ACCTGCCCGA CGGTGGCCAC
GGCGACACCG GTATCGCCCT GGTCGAAGCG GTCAAACCGC GACGCGGACC CGACCGGGAG
CACATGGTCA TCGCGCTGGA GGACCTGCCC ACCGGACTGG AGGCATCCGG TGAGGAGACA
CGGCTGTTCC GTTCGCCGCT GCGGGTCTAC CGGCTGCTGG TGGAGCTGAC CGAGCTCTCC
CGCCGGCACG GCGTCCACAC GTCGGCGTTC AGCGCCTTCG TCGCCCGGCC CGGCCGGCTC
GGCTCCCGCT GGGCCGAGGG GGTCAACGCC ACGGACCTGC TCTGGTGGGC CCGACGCCGC
GACTTTCCCG CCGCAGCCGA CGCAGGTCCG GGCACGAAAC CGGCGGTGCA CGTCGGACGC
CTGCGCCAGA CCGTGATCGA ACGCCGTCGG CAGCCGGTCG CCCATACCCG GCAGACCATG
AACGACCACT ATCTGCGGCG CAGCCGCACG GTCCAGGACG ACAGCCGCAT GGTGGTCGGT
GCCGCGCTGC GCGAGCAGGT CGACAGCGCG CGGACAGCCC AGAGCATGCC CGTACTCACC
GTCGCCTTCC TTGCCCACGC CCGCCGCGAC CCCGCCGCCG CGGCGGCCAC GGCCGGGATG
GACCAAGACA CCCTGCGCCG CCTGATCTCC GGGGTGCAGG ACACCGCCCT TGCCTCCTGC
GCGGACCACC GCAACGGCCC GCACACCACG GCGGGACAGC CCTGCCTGGC GTCGTTTCTG
GACTGTCTGG ACTGCCCGAA CGCCCGCGCG CTGCCCCACC AGCTCGGCGT GCAGATGCTG
GCCGCCGAGC GGCTGCGCGC GCTGCGACCG AACATCACCC CGGCTGTCTG GGAGGCGCAC
TTGCGCCGGC GTCTCGACCA GCTGGAGGAG ATCCTGAACC ACTACACTGC GGCCGAACGC
GACCACGCCC GCGCCACCGT GACCGCCCGC CAGCAGCAGC TCGTAGACGA CCTGCTCGAC
GGCCGATGGG ACCTGCGATG A
 
Protein sequence
MRFVEESGRR ERTFDFAVLP VERDVQRWLA RVFARRAGPR SATKRMGTAV GHFDVLRGFA 
ASLAGASPSP RCPVDLRPAH VTAFLLRYRG QPSEREYLKR LRVLLRDDPE LPEPTRTALL
SARLAPAAPA NPLVGYSDTE WQQIMTAVRR DVRLARDRIR ASRQLLDRFR VGAVPSEGPE
SGLAVLLDVF DRTGDLPRID SGGHSRAVRD AGGMTAIGGR LCLSSDEAVA FCLLLVALTG
ENFGTVAAWP AAHHLPDGGH GDTGIALVEA VKPRRGPDRE HMVIALEDLP TGLEASGEET
RLFRSPLRVY RLLVELTELS RRHGVHTSAF SAFVARPGRL GSRWAEGVNA TDLLWWARRR
DFPAAADAGP GTKPAVHVGR LRQTVIERRR QPVAHTRQTM NDHYLRRSRT VQDDSRMVVG
AALREQVDSA RTAQSMPVLT VAFLAHARRD PAAAAATAGM DQDTLRRLIS GVQDTALASC
ADHRNGPHTT AGQPCLASFL DCLDCPNARA LPHQLGVQML AAERLRALRP NITPAVWEAH
LRRRLDQLEE ILNHYTAAER DHARATVTAR QQQLVDDLLD GRWDLR