Gene Franean1_6589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6589 
Symbol 
ID5674904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8018594 
End bp8019595 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content78% 
IMG OID641245440 
ProductApbE family lipoprotein 
Protein accessionYP_001510832 
Protein GI158318324 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1477] Membrane-associated lipoprotein involved in thiamine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.162386 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTAGGG CCGGCGCCGG CCAGGACGCG CGCCCGTCGC GCCCGTTGCA CCCGTCGCGC 
CCGTCGCGAC CGGGGCGCAG CCACGCCGAA CCGGTGATGG GCACGGTTGT CAGCATCGAC
GTCCGCTCGC CGCTCGACGC GGTCGGCCTG GACGCGGCGA TCGGTGCCGC GGTCCGTGTG
CTGCATCGGG TCGACGAGGA TTTCAGCACG TTCCGGGCGA CGTCCTGGGT GTCGCGGCTG
CGCCGCGGCG AGATCGAGCT CGGCGACTGT CCGGATCACG TCCGCGAGGT GTACCGGGCG
GCCGCCGAAT GCCGGGAGCA GACCGGGGGA TGGTTCGACC CCGCCTGGCG CGGGGACGGC
ACGCTGGACC CGACCGGGCT GGTCAAGGGC TGGGCGGCCG ACGCCGCGTC GACGGCCCTG
ACCGCCGCGG GTGCCCCGAG CCACTGCGTC AGCGCCGCCG GTGACCTGCG GGTGCGGGGC
ACCTCGGGGT GGGCGCCGGG ACGTCCATGG CGGATCGGGA TCGCCGATCC GTTCGACCGG
GCCAGGCTGG TCGCCGTGGT GGAGGGCACC GAGCTGGCCG TCGCGACCTC GGGGGTCGCC
GAGCGGGGCG CGCACGTCGT CGACCCGCGT ACCGGCGCCC CGGCGACGGG CCTCGCCTCG
GTCACCCTCG TGGGCGCCGA CCTCGTGGGT GCCGACGCCA CGCTCGCGGA CGGGTTCGCC
ACCGCCGCCC TCGCCGCCGG GCCGGAGGCG CCGGCCCTGC TCACCCACCT CGCCCGCCGG
GGATGGGAGT GGCTGACGGT CGACACCACC GGGCGGCTCA CGCACTCCGC CGGCTTCCCG
GGCCAGGCCA CCACGACCGC CGCGGGGCCG GTCGCCGAGA CAGCCCCGCG GCCGGCCGTC
CGAGCGACAG CCCCGCGGCC GGCCGTCCGA GCGACAGCCC CGAGGCCAGC CGTGCGAGCG
ACCGCCCCGA GGCCGGCCGC CGGACGTCCC GACGGGAGAT GA
 
Protein sequence
MSRAGAGQDA RPSRPLHPSR PSRPGRSHAE PVMGTVVSID VRSPLDAVGL DAAIGAAVRV 
LHRVDEDFST FRATSWVSRL RRGEIELGDC PDHVREVYRA AAECREQTGG WFDPAWRGDG
TLDPTGLVKG WAADAASTAL TAAGAPSHCV SAAGDLRVRG TSGWAPGRPW RIGIADPFDR
ARLVAVVEGT ELAVATSGVA ERGAHVVDPR TGAPATGLAS VTLVGADLVG ADATLADGFA
TAALAAGPEA PALLTHLARR GWEWLTVDTT GRLTHSAGFP GQATTTAAGP VAETAPRPAV
RATAPRPAVR ATAPRPAVRA TAPRPAAGRP DGR