Gene Cag_1790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1790 
Symbol 
ID3747210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2311272 
End bp2312963 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content49% 
IMG OID637774328 
Producthypothetical protein 
Protein accessionYP_380084 
Protein GI78189746 
COG category[S] Function unknown 
COG ID[COG2989] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATAC TTACCCGTTC CATGGTGCTA CAGCGTCAAG CATCGCCGTA TCAGTTTCGT 
TTTGCGTTGA TTCGATGTAT TGCGGTATTG ATGCTTTGTG CACCAATTTC TCTCTATGCG
GCTGAGGAGG TGCAGCAAGC GGAATCAGCA GCGTGGAAGC GCGATGTAGC ATTGCGTCTT
GAGAAATATT GTATTACAGT TTTTCGCTCG CCCGGTAGCG GTAAAACAAG AGAGAACAAC
TTGCGCGTTG CTCGCTTTTA TGCCACTCGT AGTTACCAGC CACTTTGGAG TAGCACCACT
ATGACGCAAG AGCTTGCCAC ATCACTTAAT GCCGCATTTG AACATGGTTT AACGCCTGCC
GAATACGATG TTGCGGGTGA ACTTCCTCGT TGGATGGCGC TTACCAACCG CTCTGCTGCC
GCGCAAGCTC GTTACGATGT GCTTGCCACG CGTGCGTTTC TTACGCTTGC CACGCACTTA
CGCTACGGCA AACTCGACCC CGTGCGCTTT GAACCAACAT GGAATTTTTC GTCGCCACCA
AATCTCTTTC ATTTTGATGA ACTGTTAGCA CGCACCTTGC AGCGCACCTC TCCAAGTGAA
GTGCTCAATG GCTTGCTCCC GCGTGATCCG GGGTATGATG TATTGAAAAA AGAGCTGGCA
CGTTACCGAG AAATAGCAAA AAATGGGGGA TGGTCCGCCA TTCCTGCGGG AACGCTTTTG
CAGGAAGGAA GCCGTGATGC TCGTGTGCCG CTATTGCGCC AACGCCTTGC TGCTTCGGGC
GATATAAGCT CAAGTGCGGT AGCTGATACC ACAACCCTAT ACAACCCTGA TGTAACAAAA
GCCGTAAAGC GCTTTCAGCA ACAGCATGGT TTATGGAGTG ATGGAGTTGT TGGTGCTACC
ACCTTACGCG CCATCAATGT AAGTGCAGAT GAACGAATTG GGCAATTGCG GGTTAATTTG
GAGCGTTGCC GCTGGCTTTT GCATGATATT TCACCAACCT CCGTAATTGT TAACATTCCA
GCATACACGT TGCACTATTT TGAGCAAGGC GATCGCCGCT GGAGTACTCG CGTCATTGTG
GGGCAGCCCA AACGACCTAC ACCGGTGTTT CGTGCCGATA TGCAATTGCT TATTCTTAAC
CCCCGCTGGG TAGTGCCCTC AACCGTTTTG GCAAAGGATG TGCTGCCCGC AGTTATCAAA
GATCCTGCAT ATCTCCGCAA AAAAAAATTA CGAGTTGTTG ATGAAAATGG TACCATTATT
GATCCAGCAA CCATTAAATG GTCAAGCTAT TCAGCCAGCA CCTTACCATA CCGTTTACAG
CAAAAATCGG GGGATGATGG GGCGCTTGGA CGCATTAAAT TCCTTATGCC CAACCGCTAC
ACCATCTATT TGCACGACAC TCCCGATAAA GCCTTGTTCC AAAAAACACA ACGCGCTTTT
AGCTCAGGCT GTATTCGTGT GCAACACCCC GAAGAACTTG CTCGTCTTGT GCTTCGCCAT
AGCAATCGAG AAAGTCGTCC CTCTCTTGAA AGCCGCATTA AAAGTGGTGC AACATCAACC
ATTCGCCTTC CGCAACAAAT TCCCGTCTAT TTAATTTACC TGACGGCACT ACCCTGCAAC
AACAAAGCTG AATTTCGAGA AGATATTTAT CATCGCGATC CTCAAATTCT TAAAGCGTTA
GACGCGAAGT AG
 
Protein sequence
MAILTRSMVL QRQASPYQFR FALIRCIAVL MLCAPISLYA AEEVQQAESA AWKRDVALRL 
EKYCITVFRS PGSGKTRENN LRVARFYATR SYQPLWSSTT MTQELATSLN AAFEHGLTPA
EYDVAGELPR WMALTNRSAA AQARYDVLAT RAFLTLATHL RYGKLDPVRF EPTWNFSSPP
NLFHFDELLA RTLQRTSPSE VLNGLLPRDP GYDVLKKELA RYREIAKNGG WSAIPAGTLL
QEGSRDARVP LLRQRLAASG DISSSAVADT TTLYNPDVTK AVKRFQQQHG LWSDGVVGAT
TLRAINVSAD ERIGQLRVNL ERCRWLLHDI SPTSVIVNIP AYTLHYFEQG DRRWSTRVIV
GQPKRPTPVF RADMQLLILN PRWVVPSTVL AKDVLPAVIK DPAYLRKKKL RVVDENGTII
DPATIKWSSY SASTLPYRLQ QKSGDDGALG RIKFLMPNRY TIYLHDTPDK ALFQKTQRAF
SSGCIRVQHP EELARLVLRH SNRESRPSLE SRIKSGATST IRLPQQIPVY LIYLTALPCN
NKAEFREDIY HRDPQILKAL DAK