Gene Franean1_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2114 
Symbol 
ID5670514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2540300 
End bp2541754 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content72% 
IMG OID641241035 
ProductCBS domain-containing protein 
Protein accessionYP_001506456 
Protein GI158313948 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCCG GTGATCTTCT CCTCGTCTTC ATCGCCGCGG TGACCGCGCT CGCCGCGGCC 
GGCCTGGGCG CGGTGGACGC GGCGCTGACC AGGGTCTCCC GGGTCAGCGT CGACGAGTTC
GCCCGGCAGG GCAAGACGGG GGCGGCGAAC CTCGCCCGGG TCGTCGCCGA CCCCGGGCGC
TACCTGGCGT TGTTGTTGCT GCTGCGGATC GCCGGCGAGA TGGCCGCGGC GGCCTGCATC
ACCGTCCTGG CCGTGCACGC CTACGGCTCC GGGTTCGCCG CCGTCGGGCT GGGGGCCCTG
GTGTCGACGC TCGTCGCCTA CATCCTCGTC GGCGTGATGT TCCGGACGCT CGGCCGCCAG
CACGCCCCGG CGGTGTCGCT GGCCACCGCC GGGCTGACCA TCCGGCTCGC GAAGGTCTTC
GGCCCGCTGC CCCGCCTGCT GATCGCCTTC GGCAACGCGG TGACGCCCGG CCCCGGCTAC
CGGGACGGCC CGTTCGCCTC GGAGGCGGAG CTGCGCGACC TCGTCGACCT CGCCGAGGAG
AACGAGGTCA TCGAGCCCGA GGAGCGCGAC ATGATCGCCT CGGTGTTCGA GCTGGGTGAC
ACGCTCGTCC GCGAGGTGAT GGTGCCGCGG CCCGACATGG TCTTCATCGA GTCCGACAAG
ACCGTCCGGC AGGCGATCTC GCTGGCGCTG CGCAGCGGCT TCTCCCGCAT CCCGGTGATC
GGCGAGAGCA TCGACGACGT CGTCGGCATC GGCTTTCTCA AGGACATGGT CGGCTGGGAG
CGGGAGGGCC GGGAGAGCAG CCGGGTCGCG GAGGTCATGC GCCCACCCGT GCTCGTCCCG
GAGAGCAAGC CCGCCGACGA CCTGCTGCGC GAGATGCAGG CGTCGCGCAC CCACATGGCC
ATCGTCATCG ACGAGTACGG CGGCACGGCC GGCCTGGTGA CCATCGAGGA CGTCCTCGAG
GAGATCGTCG GTGAGATCAC GGACGAGTAC GACAGCGCCA CGCCGCCGGT CGAGTGGCTC
GACGACGACA CCGCCCGGGT GACGGCGCGG CTCGACGTCG ACGACCTCGC GGACCTGTTC
GGCGTCGAGG AGCTGCCCGG AGCGCAGGAC GTCGAGACTG TCGGCGGGCT GCTCGCGAAC
GCGCTGGGCC GGGTGCCCAT CCCCGGCGCG ACGGCGGACG TCGCCGGCCT GCGGCTGTCG
GCAGAGCGCG CGGCCGGGCG GCGCAACCAG ATCGGCACCG TCGTCGTCCA CCGGCTGTCC
CCGGCACCGG GCAACGGCGG GAACGGAGCC GGCAGGAGCG GAGCCGGCAA GGGCGCCACC
GGCAAGGGCG CCACCGGGAG TGACAGCAAG GGCAGCAACG GCAAGGGCAC CAACAGGAAA
ACCGACGGCA AGAAGACCGA CGGCGAGTCG GAGGGCCACC CGCCGGGCCC AGCGAGCAGA
AAGGTGACAT CGTGA
 
Protein sequence
MDSGDLLLVF IAAVTALAAA GLGAVDAALT RVSRVSVDEF ARQGKTGAAN LARVVADPGR 
YLALLLLLRI AGEMAAAACI TVLAVHAYGS GFAAVGLGAL VSTLVAYILV GVMFRTLGRQ
HAPAVSLATA GLTIRLAKVF GPLPRLLIAF GNAVTPGPGY RDGPFASEAE LRDLVDLAEE
NEVIEPEERD MIASVFELGD TLVREVMVPR PDMVFIESDK TVRQAISLAL RSGFSRIPVI
GESIDDVVGI GFLKDMVGWE REGRESSRVA EVMRPPVLVP ESKPADDLLR EMQASRTHMA
IVIDEYGGTA GLVTIEDVLE EIVGEITDEY DSATPPVEWL DDDTARVTAR LDVDDLADLF
GVEELPGAQD VETVGGLLAN ALGRVPIPGA TADVAGLRLS AERAAGRRNQ IGTVVVHRLS
PAPGNGGNGA GRSGAGKGAT GKGATGSDSK GSNGKGTNRK TDGKKTDGES EGHPPGPASR
KVTS