Gene Francci3_3621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3621 
Symbol 
ID3904175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4323575 
End bp4324972 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content72% 
IMG OID637880942 
Productisopropylmalate isomerase large subunit 
Protein accessionYP_482702 
Protein GI86742302 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0065] 3-isopropylmalate dehydratase large subunit 
TIGRFAM ID[TIGR00170] 3-isopropylmalate dehydratase, large subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.766514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGCA CACTTGCGGA AAAGGTCTGG GACGCCCACG TGGTGCGGCG CGCCGACGGT 
GAACCGGATC TGCTGTACAT CGATCTGCAC CTGGTTCACG AGGTCACCTC GCCGCAGGCG
TTCGAGGCCC TGCGGCTGGC CGGGCGGCCC GTGCGGCGTC CGGAGCTGAC CCTCGCTACC
GAGGATCACA ACGTCCCCAC CACCGACACG CTGGCGCCGA TCGCCGATCC GATCTCGGCG
GCCCAGGTGG AGGCGCTGCG GAAGAACTGC GCCGAGTTCG GCGTGCGGCT GTACCCGATG
AACGACCCGG GCCAGGGCAT CGTGCACGTC GTCGGCCCCC AGCTCGGGCT GTCCCAGCCC
GGCATGACGA TCGTCTGCGG TGACAGCCAC ACCTCCACCC ATGGCGCGTT CGGGGCGCTG
GCCTTCGGGA TCGGCACCAG CCAGGTCGAG CACGTCCTCG CGACCCAGAC GCTGCCGCAG
CGCAGGCCGA AGACGATGGC GATCACCGTG GCGGGCGACC TGCCCGTCGG GGTCAGCGCG
AAGGATCTCA TCCTGGCGAT CATTGCGCGG ATCGGTACCG GCGGTGGTGC CGGCCACGTC
ATCGAGTACC GCGGTGCGGC GATCCGGGCC CTGTCGATGG AGGGCCGGAT GACGGTCTGC
AACATGTCCA TCGAGGCCGG CGCGCGCGCC GGGATGATTG CGCCCGACGA CGTCACGTTC
GAGTATCTCG CCGGGCGGCC GCGTGTCGCC ACCGGTGCTG CCTGGGAGGA AGCGGTGGCC
TACTGGCGCA CCCTCGCCTC CGACTCCGAC GCGGTCTTCG ACCGGGAGGT CGTGATCGAT
GCCGCGAGCC TCACGCCCTA CGTCACCTGG GGAACCAACC CGGGCCAGGC CGCACCGCTC
GGATCGCTGG TTCCCGCGCC CGCCGACTAC CCGGACGCGG CCGCGCGGGC CTCGGTCGAA
CGAGCGCTGA CCTACATGGG CCTCACCCCC GGCACCCCGT TGTCCGACGT CACCGTCGAC
ACGGTGTTCA TCGGATCGTG CACCAACGGG CGCCTGAGTG ACCTGCGCGC CGCCGCCGAC
GTGCTGCGCG GCCGGCGGGT GAGCGAGGGG GTCCGGGTCC TGATCGTTCC CGGCTCCATG
GCGGTGAAGG CGCAGGCCGA GGCGGAGGGG CTCGACGAGG TCTTCCGAGC GGCGGGAGCG
CAGTGGCGCA GCGCCGGCTG TTCGATGTGC CTCGGCATGA ACCCCGACAC GCTTCGGCCC
GGCGAGCGCA GTGCCTCGAC GTCGAACCGC AACTTCGAGG GCCGGCAGGG GCCGGGTGGG
CGCACCCATC TCGTCTCGCC CGCGGTCGCC GCGGCCACCG CCGTGACCGG TCGGCTGACC
GCTCCGGCGG ATCTGTAG
 
Protein sequence
MGRTLAEKVW DAHVVRRADG EPDLLYIDLH LVHEVTSPQA FEALRLAGRP VRRPELTLAT 
EDHNVPTTDT LAPIADPISA AQVEALRKNC AEFGVRLYPM NDPGQGIVHV VGPQLGLSQP
GMTIVCGDSH TSTHGAFGAL AFGIGTSQVE HVLATQTLPQ RRPKTMAITV AGDLPVGVSA
KDLILAIIAR IGTGGGAGHV IEYRGAAIRA LSMEGRMTVC NMSIEAGARA GMIAPDDVTF
EYLAGRPRVA TGAAWEEAVA YWRTLASDSD AVFDREVVID AASLTPYVTW GTNPGQAAPL
GSLVPAPADY PDAAARASVE RALTYMGLTP GTPLSDVTVD TVFIGSCTNG RLSDLRAAAD
VLRGRRVSEG VRVLIVPGSM AVKAQAEAEG LDEVFRAAGA QWRSAGCSMC LGMNPDTLRP
GERSASTSNR NFEGRQGPGG RTHLVSPAVA AATAVTGRLT APADL