Gene Cag_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1721 
Symbol 
ID3746987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2234849 
End bp2236366 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content32% 
IMG OID637774258 
Producthypothetical protein 
Protein accessionYP_380015 
Protein GI78189677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCTT TACCCGTAGG CATACAAACG TTTAGTGAAA TCATTAAACA AGATTACTTG 
TATATTGATA AAACAAGTTT AGCTAATGAA TTGATTAAGA GATACAAATA TGTCTTTCTT
TCACGTCCGC GACGCTTTGG TAAGAGTTTG TTTCTTGATA CCCTAAAAAA TATTTTTGAA
GGCAAGCAAG AGCTTTTTAA AGAGTTACTT ATTTACAAAC AATGGAATTG GAATGTAACC
CATCCCGTTA TTAAAATCAG TTTTAGTGGT GGCATACGCG ATAAAGAAAG CCTTCGTGAT
AATCTTTTTT ATATCTTAAA AGATAATCAA GAACGGCTTA ACATAAATTG TGAAGAAAAA
AACAATCAAA ATCTATGTTT TGCAGAACTC ATTAAAAAAG TCTATCAAAA ATATCAACAA
AAAGTAGTTA TTCTCATTGA CGAGTACGAC AAGCCAATTC TCGATAATAT TGAAAATATT
CCTGAAGCTC TCATCGTCCG TGATGGAATG CGAGACTTTT ATAGCAAAAT AAAAGAGAGT
GATGAATATT TACGATTTGT TTTTCTTACA GGAGTAACCA AATTTTCAAA AGTATCGCTT
TTTAGTGGCT TAAACAATCT TGAAGATATT AGCCTCAATC CTGATTTTGG CAATGTTTGT
GGCTACACAC AGCATGATGT TGATACCATT TTTGCTCCCT ATTTTGAAGG TGTAGATATG
GAAGAGGTTA AGCGATGGTA TAATGGCTAT AACTTTCTCG AAGATAAAGT TTACAATCCT
TTTGATATTT TGCTTTTTAT TAAAAATCAA CGGATGTTCA AAAACTATTG GTTTGAGACG
GGAACACCAA GATTTCTCAT TGAGCTAATA AAAAAGAACA ACTACTTTAT TCCTAAGTTA
AACAAACTTA AAGTTAATGA ATCATTAGTC AATAGTTTTA ATCTTGAAAA TCTTAATTTA
GAAACTATTT TATTTCAAGC AGGCTACTTA ACGATTAAGC GATTGTTGCC ATCAGGTATG
GGAGTTGGCT ACGAGTTGGG ATTTCCTAAT AAAGAGGTGC AAATCAGTTT TAATGATTAC
ATCTTACAGG TAATGACCAT TGTTTCAGAT AAAGAGCCGA TTCGCTATGA GCTTTTTGAT
ATTATCAATA ATGGAGATGT TGCAAATTTA GAACCCATCA TCACACGTCT TTTTGCAAGC
ATTGCTTATA ATAATTTTAC CAACAATTAT ATTGAGAGCT ACGAAGGCTT TTATGCGAGC
ATCCTTTATG CTTATTTTGC AAGTCTTGGT TTTGATATTA TTGCTGAGGA TCTAACCAAT
AACGGACGAA TAGATTTAAC TCTTAAAAAT TATGAGAAAA CCTATTTGTT TGAATTTAAG
GTGAGTAATC AAGAACCACT TGAGCAAATC AAGAAAATGA AATATTACGA GAAATATGAT
GGCGAACGTT ATCTCATTGG CATTGTTTTT GATCCAAAGG CGAGAAACGT CAGTCAGTTT
GTATGGGAAA AGGTTTAA
 
Protein sequence
MKPLPVGIQT FSEIIKQDYL YIDKTSLANE LIKRYKYVFL SRPRRFGKSL FLDTLKNIFE 
GKQELFKELL IYKQWNWNVT HPVIKISFSG GIRDKESLRD NLFYILKDNQ ERLNINCEEK
NNQNLCFAEL IKKVYQKYQQ KVVILIDEYD KPILDNIENI PEALIVRDGM RDFYSKIKES
DEYLRFVFLT GVTKFSKVSL FSGLNNLEDI SLNPDFGNVC GYTQHDVDTI FAPYFEGVDM
EEVKRWYNGY NFLEDKVYNP FDILLFIKNQ RMFKNYWFET GTPRFLIELI KKNNYFIPKL
NKLKVNESLV NSFNLENLNL ETILFQAGYL TIKRLLPSGM GVGYELGFPN KEVQISFNDY
ILQVMTIVSD KEPIRYELFD IINNGDVANL EPIITRLFAS IAYNNFTNNY IESYEGFYAS
ILYAYFASLG FDIIAEDLTN NGRIDLTLKN YEKTYLFEFK VSNQEPLEQI KKMKYYEKYD
GERYLIGIVF DPKARNVSQF VWEKV