Gene Francci3_0839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0839 
Symbol 
ID3905116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp980608 
End bp981969 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content72% 
IMG OID637878172 
Productallantoinase 
Protein accessionYP_479952 
Protein GI86739552 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0044] Dihydroorotase and related cyclic amidohydrolases 
TIGRFAM ID[TIGR00857] dihydroorotase, multifunctional complex type
[TIGR03178] allantoinase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAGC CCACCGCGGA CGTTCTGCCG GAGCGGCCAC AGGCGCTACG GTCACGGCGA 
GTCGTGCTGC CCGGGGGCGA GCGTCCGGCA GCCGTGCACG TCGCGGACGG CCGGATCGCC
GCGATCACCG CCGCGGACGA GATCCCTCCC GGGACCCTGG TCACGGACCT CGGCGAGCTG
GCTCTGCTTC CCGGCGTGGT CGACTCCCAC GTGCACATCA ACGAACCGGG GCGGACCGAG
TGGGAGGGGT TCGCGACCGC GACCCGGGCC GCCGCGGCCG GTGGTGTCAC CACGATCATC
GACATGCCGC TGAATTCCGT TCCGCCGACG ACCTCGCTCG CCGCCCTGGC CGCCAAACGT
GCGGTGGCGG CCGGCCAGGT CGCCGTCGAC GTCGGCTTCT GGGGCGGCAT CATCGGCGCC
GACGCCCGCA ACCTCGATGA TCTGGCCGCC CTGCACGGCG CCGGGGTCTT CGGATTCAAG
GCCTTCCTGG CCCCGTCGGG GGTCGAGGAG TTCCCGCACG TCTCGATGGA CGTCCTCGGG
GCCGCCGCCC GGCGCACCGC CCGGATGGGG GCCCTCACCG TCGTGCACGC CGAGGCGCCC
TCCGTTCTCG CCCTGGCACC TCCCACCGTG GGGCGGGCCT TCGCCAGCTG GTTGGCGTCG
CGCCCGCCGG CCGCGGAGAC CGAGGCCGTG GCCGCGCTGG CGGCGCTGTC CGCCGCCAGC
GGTGCCCGCC TGCACGTGCT GCATCTGGCG GCGGCCGATG CCCTCGACGA CGTGCTCGCG
GCGCGGGACG CGGGGTTGCC GATGACGGTG GAGACGTGCC CGCACTACCT GACCTTCACG
GCCGAGGAGG TCCCCGACGG AGCTACCGTC TTCAAGTGCG CACCTCCCAT CCGGGACAGC
AGGAATCTGG ACCGGTTGTG GGACGGGGTG GCCCAGGGCC TGTTCGCCGG GATCGTCACT
GATCATTCGC CGTCCACTCC CGCGTTGAAG CAGATCGACG CCGGTGATTT CGCGGCGGCG
TGGGGTGGCA TCGCGTCTGT CCAGATCGGG CTACCCGCGG TCTGGACTCA GGCCCGGGCG
CGGGGGCACA CCCTCACCGA CGTCGTCGGT TGGATGTGTG CCGGACCGGC CGACCTGGTA
GGACTGGCCG GCAAGGGGCG CATCGCGATC GGCGCTGACG CCGATCTGGT GATCTTCGAC
GCGGATGCGT CGTTCCTGGT GGAGCCCTCG ATGCTGCGCC ACCGCCACCC GCTCACCCCC
TATGCCGGAC GCGTGTTGAA CGGGGTGGTG CTGGCGACCT ATCTGCGGGG GCGGCGAGCG
GACGGGGATC GTCCTCCGCG GGGGCGGCTA TTAGAGAGGT AG
 
Protein sequence
MNEPTADVLP ERPQALRSRR VVLPGGERPA AVHVADGRIA AITAADEIPP GTLVTDLGEL 
ALLPGVVDSH VHINEPGRTE WEGFATATRA AAAGGVTTII DMPLNSVPPT TSLAALAAKR
AVAAGQVAVD VGFWGGIIGA DARNLDDLAA LHGAGVFGFK AFLAPSGVEE FPHVSMDVLG
AAARRTARMG ALTVVHAEAP SVLALAPPTV GRAFASWLAS RPPAAETEAV AALAALSAAS
GARLHVLHLA AADALDDVLA ARDAGLPMTV ETCPHYLTFT AEEVPDGATV FKCAPPIRDS
RNLDRLWDGV AQGLFAGIVT DHSPSTPALK QIDAGDFAAA WGGIASVQIG LPAVWTQARA
RGHTLTDVVG WMCAGPADLV GLAGKGRIAI GADADLVIFD ADASFLVEPS MLRHRHPLTP
YAGRVLNGVV LATYLRGRRA DGDRPPRGRL LER