Gene Cfla_0208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCfla_0208 
Symbol 
ID9144074 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCellulomonas flavigena DSM 20109 
KingdomBacteria 
Replicon accessionNC_014151 
Strand
Start bp245637 
End bp248753 
Gene Length3117 bp 
Protein Length1038 aa 
Translation table11 
GC content74% 
IMG OID 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionYP_003635326 
Protein GI296128076 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGCTGC TGCTGATCCT CCACCTCGCA GCAGCCCTCG TCGCGCCCCT GCTGTTCACG 
TGGCTCGGCC GGCGCGCGTT CTGGACGCTC GCGCTGGCGC CCGCCTCCGC GGCCGTGTGG
GCGCTGACGC AGACCGCCGA GGTGCAGGCC GGCGACGGCC CGGTCCAGGT GGTCGAGTGG
ATCCCCGCGC TCGGCATCGA GCTGAGCTTC CGGCTCGACA CGCTCTCGTG GCTCATGACG
GTGGTCGTCG GCGGTGTCGG CGCGCTCGTG CTGGTGTACT GCGCGGCGTA CTTCTCGCCG
ACCGCGTCGG GCCTCGGGCG GTTCGGCGGC GTCTTCACGG CCTTCGCCGG CTCGATGCTC
GGCCTCGTCA CGGCCGACGA CCTGCTCCTG ATGTTCATGT TCTGGGAGCT GACGACCGTC
ACGTCGTACC TGCTCATCGG CCACTACGCC GACCGCAAGG CGAGCCGGCG CGCCGCCATG
CAGGCGATCA TCGTCACCAC CGCGGGCGGC CTCGCGATGC TCGTCGGTGT CGTCATCCTG
GGCCACGCGG CCGGCACCTA CTCGATGTCC GCGCTCGTGA CGGACCCGCC CGCGGCGTCC
CCCGCGATCA TCGCCGCCGT GGCGTGCCTG CTCGCGGGCG CGGCGACGAA GTCCGCGCTC
ATCCCCTTCC ACTTCTGGCT GCCCGCCGCC ATGGCCGCGC CGACACCCGT CAGCGCGTAC
CTGCACGCCG CCGCGATGGT GAAGGCCGGC GTGTACCTGG TCGCCCGCTT CGCGCCCGCG
TTCGCCGAGC TGCCCGTCTG GCGCTGGACG GTCGTCGTCC TGGGCGGCGG GACGCTGCTG
CTCGGTGGGT ACCGCGCGCT GCGGCAGCAC GACCTCAAGC TCGTGCTCGC GTTCGGCACC
GTCAGCCAGC TCGGGCTGAT CATCCTGCTG GTCGGGCTCG GCACGCAGGC CACCGCGCTC
GCGGGCCTGG CGATGCTCGG CGCGCACGCG ATGTTCAAGG CGGCCCTGTT CCTGGTCGTC
GGCACGGTCG ACGTGGCCTG CGGCACGCGT GACCTGCGGC GGCTGTCGGG TGTGGGCCGC
GCCCTGCCGT GGACGGCGCT CGCGGGCGGC CTGGCGACCG CGTCGATGAT CGGGCTGCCG
CCGTTCGCCG GGTACGTCGC CAAGGAGGCG GGCCTCGAAG CCCTCGTGCA CCTCGAGGAC
GGCACGATCG GGACGGTCGT GCTCACGGTC GTCGTCCTCG GCTCGGCGCT GACCGTCGCG
TACGGCCTGC GCTTCTGGTG GGGCGCGTTC GCGACGAAGC CCGCGCTGGT CCCGCAGTCG
CAGACGCAGG ACGAGCAGGG CGTCGCGACG ACCACGTCGC TGGCCGCGGG CGGCGACGAG
TCCGTCGAGC CCGCCCCGGT CTCCCGCCCG TCGTTCCTGC TCACCGCTCC CGCGCTGGTC
CTGGCGTTCC TCGGCGTCGC GGTCGCGCTG CTCCCGCAGC TCGGCGAGCA CCTGCTCGCG
CCGTACGCCG CGACGTACCC CCTCGGCGAG CCCGGGCACC TCGTGCTGTG GGGCGGGTTC
GGGCTCGTGC TGTGGCTCAC GGTCGGCGTG CTCGGCGTCG GTGGCCTGCT GTTCCTCGTG
CGGGACAGGG CCGAGCGGTG GCAGGCGCGC GCACCGCACG TCTTCGAGGC CGACCTCACG
TACCGCCGTT CGATGCGCAA GCTCGACGAC GTCGCCGCCG ACGTCACCGC CGTCACCCAG
CGTGGTTCGC TGCCCCTGTA CCTCGGCGTC ATCCTCTTCA CCTGGGTCGC CGCCGTGGGC
ACCGCGCTCC TGGCGGGCAC GTCGCTCCAG GTGGAGACAC GCCCCTGGGA CTACGCCGCG
CAGACGATCT TCGCGGCCGC CGCGATCGTC GCCGCAATCC TCGTCGCCCG CGCCCGTCGC
CGGCTCAAGG CCGTCATCCT CGCCGGCATC AGCGGCTACG CCACGGCCGG GATGTTCCTG
CTGTACGGCG CGCCGGACCT CGCGGTCACG CAGGTGCTGG TCGAGACGAT CACGCTCGTC
GTGTTCGTGC TCGTGCTGCG CCGGCTCCCC CCGTACTTCT CCGACCGCCC GCTCGCCGGG
TCGCGGTGGC TGCGCCTCGG CCTGGGCCTC GCGGTCGGCC TCACCGTCGC CGGGGTCGCG
CTCGTCGCGC CGTCGGCGCG CGTGCACGCG CCGGTGTCGA CGGACTTCGC GACCGAGGCC
TACGAGTTCG GCGGCGGCAA GAACATCGTC AACGTCACGC TCGTCGACAT CCGCGCGTGG
GACACCATGG GCGAGCTCTC CGTGCTGCTC GTGGCCGCGA CGGGCGTCGC GTCGCTCGTC
TTCCTGTCGG CACGCGGTGG ACGGATCTTC CGCGAGCGGG AGGCTCCCGC CGACCGCGCG
GTGTGGGGCG GCACCCCCGA CCCGATGGCG GCCCTGCGGC GTCCCGGCAC GGCGCCCGGG
ACCACCGCGC CGACGGGCGC GCAGGCCATC GACGGCGGCG CGCCGGCCAC GCCCTCGCGC
GGCACGGGTC GCGCCGTCGG GTCCCGCGCG CGCGAGTGGC TGCGCGCCGG GCGCACGGTC
GCGCCGCAGC GCCGCTCGGT GATCTTCGAG GTCGTCGTGC GCCTGCTGTT CCACACGATG
ATCGTGTACT CCGCGTTCCT GCTGTTCAGC GGCCACAACC AGCCCGGTGG CGGGTTCGCC
GCAGGGCTCG TCACGGGGAT CGCGCTCGCG GTGCGGTACC TCGCGGGCGG CCGCTACGAG
CTGGGCGAGG CCGCGCCCGT CCAGCCGGGC GTCCTGCTCG GGATGGGTCT GTTCCTGTCC
GCCGGTGTGG GGCTCGCGGC GCTGTTCGTG GGTGGCAACG TGCTGGAGTC CTGGATCCTG
GAGTTCCGGC TGCCGGTCTG GGGCGACGTC AAGCTGGTGA CCAGCCTGTT CTTCGACGTG
GGCGTGTACC TCGTGGTCGT CGGGCTCGTG CTCGACATCC TGCGCAGCGT CGGCGCGGAG
ATCGACCGCC GTGCGGAGAC GGGGGAGGAC GAGGTCGCGG AGCACGCCGT CAGCGGCTAC
GACCACGTCG TCGTCGCGCG CGGGTCCGGG TCGACCGGTG AGGGGACGGC ACCGTGA
 
Protein sequence
MLLLLILHLA AALVAPLLFT WLGRRAFWTL ALAPASAAVW ALTQTAEVQA GDGPVQVVEW 
IPALGIELSF RLDTLSWLMT VVVGGVGALV LVYCAAYFSP TASGLGRFGG VFTAFAGSML
GLVTADDLLL MFMFWELTTV TSYLLIGHYA DRKASRRAAM QAIIVTTAGG LAMLVGVVIL
GHAAGTYSMS ALVTDPPAAS PAIIAAVACL LAGAATKSAL IPFHFWLPAA MAAPTPVSAY
LHAAAMVKAG VYLVARFAPA FAELPVWRWT VVVLGGGTLL LGGYRALRQH DLKLVLAFGT
VSQLGLIILL VGLGTQATAL AGLAMLGAHA MFKAALFLVV GTVDVACGTR DLRRLSGVGR
ALPWTALAGG LATASMIGLP PFAGYVAKEA GLEALVHLED GTIGTVVLTV VVLGSALTVA
YGLRFWWGAF ATKPALVPQS QTQDEQGVAT TTSLAAGGDE SVEPAPVSRP SFLLTAPALV
LAFLGVAVAL LPQLGEHLLA PYAATYPLGE PGHLVLWGGF GLVLWLTVGV LGVGGLLFLV
RDRAERWQAR APHVFEADLT YRRSMRKLDD VAADVTAVTQ RGSLPLYLGV ILFTWVAAVG
TALLAGTSLQ VETRPWDYAA QTIFAAAAIV AAILVARARR RLKAVILAGI SGYATAGMFL
LYGAPDLAVT QVLVETITLV VFVLVLRRLP PYFSDRPLAG SRWLRLGLGL AVGLTVAGVA
LVAPSARVHA PVSTDFATEA YEFGGGKNIV NVTLVDIRAW DTMGELSVLL VAATGVASLV
FLSARGGRIF REREAPADRA VWGGTPDPMA ALRRPGTAPG TTAPTGAQAI DGGAPATPSR
GTGRAVGSRA REWLRAGRTV APQRRSVIFE VVVRLLFHTM IVYSAFLLFS GHNQPGGGFA
AGLVTGIALA VRYLAGGRYE LGEAAPVQPG VLLGMGLFLS AGVGLAALFV GGNVLESWIL
EFRLPVWGDV KLVTSLFFDV GVYLVVVGLV LDILRSVGAE IDRRAETGED EVAEHAVSGY
DHVVVARGSG STGEGTAP