Gene Franean1_5680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5680 
Symbol 
ID5674006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6895279 
End bp6896841 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content75% 
IMG OID641244533 
Productputative bifunctional allantoicase/OHCU decarboxylase 
Protein accessionYP_001509936 
Protein GI158317428 
COG category[F] Nucleotide transport and metabolism
[S] Function unknown 
COG ID[COG3195] Uncharacterized protein conserved in bacteria
[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase
[TIGR03180] OHCU decarboxylase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CGCCCGACCT GACGAACCTC GTCGACCTCG CCGCCGCCCG GTTCGGTGGG 
TCCGTCGTCG CCGTCAATGA CGAGTTCTTC GCGCTCGCCG AGCGCATGCT CCTGGCGGAG
GCACCGATAG TCCGTCCGGG GGTGTTCACC GAGCGCGGCC AGTGGACCGA CGGCTGGGAG
ACCCGCCGCC GCCGCGACCT GCCCGCGGCG GACTGGGCGA TCGTCCGCCT CGGCGCCCCC
GGCATCGTGC ACGCGGTGAC CGTGGACACG ACCCATTTCA CCGGCAACGC CCCGGAGTCG
GTCGAGCTGC ACGGCGCCAC CCTTGCCGGC TACCCGTCCG CCGAGGATGT CGCCGAGGAC
TCGGTCACCT GGGTGCCCCT CGTCGCCCGC ACGCCAGTCG CCGCGGACGC CGTCAACGTC
CTGCCCGTGT CGGCGGAGGG CCGGCTCCGG ATCAGTCACC TGCGGCTGAC CATCCACCCG
GACGGGGGAG TCGCCCGGCT GCGGGTGCAC GGCGTCGTCG TGCCCGACCC GCGGCTGCTC
GACCGGGTCA CCTCCGACCT GGCCGCCGCC TACCTGGGCG GTGTCGTCGT CGCCGCCAGC
GACATGCACT ACGGCGACCG GCACAACCTG AACGCCTCCG GCGACGCCCG GGCGATGGGG
GAGGGCTGGG AGACCCGCCG CCGCCGCGGG CCGGGGCACG ACTGGGCGGT GGTGCGGCTG
GCCACCGAGG GGACCGTGGT GCGGGCCGAG GTCGACACCC GCCACTTCCG CGGCAACGCG
CCGCGCGCCG TCGCGCTGTG GGCGGCGAAC GCACCCACGA CCGGCGACGA CGACCTGGCC
ACGGACGCCC TGTCGGCGAT CACCGACTGG CGGCCGCTGC TGCCGCGCAC CCGGGTCCAG
CCGAACACCC GGCACCTGTT CGACCTGGAG GTGCCGGTCG AGGCCACGCA CGTGCGGGTG
GACGCGATCC CGGACGGCGG TCTCGCCCGG CTGCGGCTGG TCGGCGCGCC GACCGAGGCG
GGACGCGAGT CGCTGGCCAT GCGCTGGTTC GACTCCCTGA CGCCCGCCGC GGCCCGTGAG
GAGCTGCTGG CCTGCTGCGG TTCCGAGGAC TGGGCGGACG CCGTCACGGC CCTCCGGCCC
TTCGACACGC TCGCCACGCT GCTGCCCGCC GCCGAGCAGG AGTGGTGGAA CCTGCCCGAA
ACCGCCTGGC TGGAGGCGTT CACCGCGCAT CCCCGGATCG GGGAGCGCCC GACACCCGCG
CCGACGCCGC CGACGTCCTC GCGGGCCACC GTCGTCTCCC TCGACGCCCC CCGACGGGAG
CAGGCCGCCC TGGACCAGGC GAGCGAGGAT GTCCGCGCGG CCTTCGCCGA GGGGAACGCC
GAGTACGAGG ACCGCTTCGG GTTCATCTTC CTCGTTCGCG CGGCGGGGCG TGGCGCGCAG
GAGATGCTGG AGCTGCTCCG GGAGAGGATG GCGAACGACC CCAGGACCGA GCTGCGGGTG
GCGGCCGGCC AGCAGGCGGA GATCACCGCG CTGCGGCTGC GCCATCTGAT CACCGGCGCC
TGA
 
Protein sequence
MTDAPDLTNL VDLAAARFGG SVVAVNDEFF ALAERMLLAE APIVRPGVFT ERGQWTDGWE 
TRRRRDLPAA DWAIVRLGAP GIVHAVTVDT THFTGNAPES VELHGATLAG YPSAEDVAED
SVTWVPLVAR TPVAADAVNV LPVSAEGRLR ISHLRLTIHP DGGVARLRVH GVVVPDPRLL
DRVTSDLAAA YLGGVVVAAS DMHYGDRHNL NASGDARAMG EGWETRRRRG PGHDWAVVRL
ATEGTVVRAE VDTRHFRGNA PRAVALWAAN APTTGDDDLA TDALSAITDW RPLLPRTRVQ
PNTRHLFDLE VPVEATHVRV DAIPDGGLAR LRLVGAPTEA GRESLAMRWF DSLTPAAARE
ELLACCGSED WADAVTALRP FDTLATLLPA AEQEWWNLPE TAWLEAFTAH PRIGERPTPA
PTPPTSSRAT VVSLDAPRRE QAALDQASED VRAAFAEGNA EYEDRFGFIF LVRAAGRGAQ
EMLELLRERM ANDPRTELRV AAGQQAEITA LRLRHLITGA