Gene Cag_1606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1606 
Symbol 
ID3746471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2096888 
End bp2098261 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content49% 
IMG OID637774146 
Producthypothetical protein 
Protein accessionYP_379904 
Protein GI78189566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.372312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAA GTGGCGCAAT TGCTAGCCTT GATGCTTTTC TTCATCGTTG GAAAGCTAAA 
GCGGGCAATT ACACGGGCGA TTACATTACA ACGCCTGAAG GGTTAGTTCG CAATAATATG
GATGATGAGC AAGGGCGCGG TGGCTACTAT CAGGAGTACG CCTGCACGTC GGAGAGCCAA
GTGATGATGG CGCGTGGTTA CCTTCGTGCC TATCAAGCAA CGGGCGAAAG CCGTTATTTG
CAAAATGCTC GCACGGCAAT GCAGGCGCTG ATTCGCTACT TCTTTTTCGG CAAAGTTCCC
TCCACTGCAA CGGCGTGGCG TTCGCATTGG ATTGTTAATG CGGGTGCTCC CTTTAAATCA
AAAGAGAATG GGCGCACAAC CGATACCATT GCTTTTGGTG AGGCATACGA GTGCTGGCCT
ACTTGGCGCA AGCTTCGCCC CAATGAATTT GCCACTGCTG GCGATTCCAT GCACTGGTTT
ATTGAGAACT TCCACCTCTT TTCGCAGCTT GAAACAGAGG ATCAAAAAGG GCAATGGCTT
GCAGCACGCG ATGCCATGTT TCGTGAATTT AAGCTGCTGC TCTCCCCAAA ATGGCAAGCG
AAGTACAAAG GAGCTATTCC TTTTGAATAC ACCAATAAGG GGGATAATCT AACTGTGCGT
TCCACCTCAA TTTTTAGAGG ACCCTATTAC ACGGGATACC AAAATCCCCT ACCATGGCTC
TACATGCAGG ATTACACCGC AGCGGCAAAC ATGCTGCAAC TCTTGGTGGA GTCGCAAGTG
GCTTATACCA AAAGCACGGG GGTAAAAGGT CCTTTTGCAC CAGTTTACCA TTACGATGCT
TCGCTGCTTG GTTCCGCAAA GAAAAATGTT TTTACATGGA ATGGACCTGA TCCCAACACC
TTTTGGGGTG GTTTTCAATA TCGTCCCTTT GCCGATGTTG CTCATTTTTG GTACCACTGT
AAGCGCTCCA ATATTCAAAA TGCCGCGGTT AGCAATGCCT CAAAAGTGTG TATGAGCTTT
TTAAGCTGGT TGGATGGCTG GCTTACCGCT CACCCCAATA ATGAATATGT ACCTACCGAA
TTCCGCGAAG CTACACAGCC CAGCGCACCA CCTGCCAATG GCGATAACGA TCCCCACATG
ATTGCCCTTG CGCTGAAAGG GGCGCTCTTT TGCAAAAATG CGGGTGCTGA TGCCGCAATG
GTTGGGCGCG TTATTGCGCG CTTGTACGCT ATGGTGATGA AGCGCCAAAG CAAAGCGGGC
GATATGGCTG GAGCTTTTAT GCACGATCCC TACAGCCATA TTTTTAAAGG ATTTTGGGCT
GGTGATATTA TGGAGGCTTT AGCGCTCTAC ATAATGCACC ACGAAAAAGG ATAA
 
Protein sequence
MSTSGAIASL DAFLHRWKAK AGNYTGDYIT TPEGLVRNNM DDEQGRGGYY QEYACTSESQ 
VMMARGYLRA YQATGESRYL QNARTAMQAL IRYFFFGKVP STATAWRSHW IVNAGAPFKS
KENGRTTDTI AFGEAYECWP TWRKLRPNEF ATAGDSMHWF IENFHLFSQL ETEDQKGQWL
AARDAMFREF KLLLSPKWQA KYKGAIPFEY TNKGDNLTVR STSIFRGPYY TGYQNPLPWL
YMQDYTAAAN MLQLLVESQV AYTKSTGVKG PFAPVYHYDA SLLGSAKKNV FTWNGPDPNT
FWGGFQYRPF ADVAHFWYHC KRSNIQNAAV SNASKVCMSF LSWLDGWLTA HPNNEYVPTE
FREATQPSAP PANGDNDPHM IALALKGALF CKNAGADAAM VGRVIARLYA MVMKRQSKAG
DMAGAFMHDP YSHIFKGFWA GDIMEALALY IMHHEKG