Gene Cag_1737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1737 
Symbol 
ID3746520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2256164 
End bp2257660 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content46% 
IMG OID637774274 
Producthypothetical protein 
Protein accessionYP_380031 
Protein GI78189693 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTATTA ATAAGGATTT GTTGGAATTT TTCGGTTCCA TGATGGAAAT CAAAAAGCAG 
AAGCGAGACA TTTTTAATGC TCTTGCGGCT GATGTTGATG ATCCCGAAAT CAGAAATACG
CTGCTTCGTA TTGGTGCTGA TGAACAACGT CATGTTGATC AAATTCAGCA AAGCATAAAC
TTAGTGAACA GCGGTTCTAC GGCAGAGCCT ATGGTTCCCG AAGCTGCACC AGCGCCACAA
GTTGCGCCAG CTCCGGCTCC ACCTGCTCCA ACAATAGCGC CTGCTACATT GCAGCCTGCT
ATTGCAATTG CTCAGCCAGC TATAGTGCGC CCCGAGCCAC CTCAGCCCGT TGCACCAATG
CCTGTTCCCG AGCCAGTAGC AGCGCCGCCA ACGTACGTGC AACCTGTGCA ACCTATTGCT
CAGCCAATTA CCCAACAAGT GGTGGTTTCG GAACCAGTAG CACCTACAGT TAGCCAATTG
CAACAGCCAG CACCACCGCA GCCACTTACC TATTTAACAC CAACAATAGC TACACCAGCA
GCTCCTGCTG AGCCCGCTTA CAGTGAGCCT GCTGAACAGT CTATTTCATC ATTTGCCAGC
CCATTATCAT CTGGCACTCA ACGCTATCCG GTTCAGCCAC CCACCTCGAA AACTTTTGAG
AACATGACAA CACTCCACCA TCCATTAGGG GAAGTTTTTG GTTTTGCAGC CACTGATCAA
TCACCAAAAG CTCAGCGTTA TCGCTCACAT CGTCATTGCC CTTTTAACAA TAAGTCGCCA
AACTGCACGA ACTCCCATAC CGAAAATCCT CTTGGTGTAT GTAGTATTTT GCATAATAAC
AAAGCAATTA TTACCTGCCC AATTCGCTTC CGTGAAGATT GGCTTATTAC CGATGATGCA
GCTTCCTTCT TTTTTGAGCC CGGTGTTCGC TGGAGTTCAT TAACCGATGT TCGTTTAGCT
GATGCCAACG GTACTTCCGC TGGTAATATG GATGTTATGT TGGTAGCTTA CGATAAAGAG
GGAAAAATTA TTGATTTTGG TGCTATTCAA ATTCAAACTG CTCACATTGA CGGTAATGTG
CGTGAGCCAT TTGAATGTTA CATGAAAGAT CCTAAGACCA ATGCTATGAT GGATTGGACC
CGTCAGCCAA ACTATCCTGA GCCCGACTTC CTTTCAGCAA TGCGCACCAG CGTTGTGCCT
GAATTGCTTT ACAAAGGTGG TATTTTGCAC TCTTGGAACA AGAAGATGGC AATTGCTATT
AACAAAAGCA TGTTTGAAAC CTTGCCACCA CTAACGCGAG TTAAAAAAGA TGAAGCCGAT
ATTGCGTGGT TGCTTTATGA GCTTGAAGCG GTAAATGACG GTGAAAAAGA GGCTTATCAG
CTTAAGAAAA GTGAAGTTGT TTATACTGCC TTCCAACCTA CCTTATTAGC TCTTACTGCC
ATTGCTCCAG GTAATGTGAA TGACTTTATG AAGTTTATTC CCGAGCTTGG CGCCTAA
 
Protein sequence
MFINKDLLEF FGSMMEIKKQ KRDIFNALAA DVDDPEIRNT LLRIGADEQR HVDQIQQSIN 
LVNSGSTAEP MVPEAAPAPQ VAPAPAPPAP TIAPATLQPA IAIAQPAIVR PEPPQPVAPM
PVPEPVAAPP TYVQPVQPIA QPITQQVVVS EPVAPTVSQL QQPAPPQPLT YLTPTIATPA
APAEPAYSEP AEQSISSFAS PLSSGTQRYP VQPPTSKTFE NMTTLHHPLG EVFGFAATDQ
SPKAQRYRSH RHCPFNNKSP NCTNSHTENP LGVCSILHNN KAIITCPIRF REDWLITDDA
ASFFFEPGVR WSSLTDVRLA DANGTSAGNM DVMLVAYDKE GKIIDFGAIQ IQTAHIDGNV
REPFECYMKD PKTNAMMDWT RQPNYPEPDF LSAMRTSVVP ELLYKGGILH SWNKKMAIAI
NKSMFETLPP LTRVKKDEAD IAWLLYELEA VNDGEKEAYQ LKKSEVVYTA FQPTLLALTA
IAPGNVNDFM KFIPELGA