Gene Cag_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2003 
Symbol 
ID3747113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2539273 
End bp2540406 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content41% 
IMG OID637774540 
Productfic family protein 
Protein accessionYP_380294 
Protein GI78189956 
COG category[S] Function unknown 
COG ID[COG3177] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0278833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTTTG AAGAGTTTAC TGCGGGATAT TGGCAGCAGC GGTATCAATA CAAAAGTTTT 
GAACCCTCCC TCATTAACCA TGAATGGACT TGGGACGAGC CAACCATTAA CACCCTGTTA
GAGCAGGCAA ATTGTGCACT TGGTGAGCTT AACGCCTTCT CATTGATTGT GCCAAATATT
GACCTTTTTA TTCAAATGCA CGTTGTTAAA GAGGCTCAAA CATCAAGCAA AATTGAAGGT
ACGCAAACTG GCATTGATGA GGCATTGTTG TCGGAGGAAC AAATCAGCCC TGAAAAGCGA
GATGATTGGC GAGAGGTGCG TAACTATATT GACGCCGTTA ACAGTGCCAT TACAACATTG
CACGACTTAC CGCTTTCAAA TCGCCTTTTA AAACAAACAC ACAAAATTTT ACTCAGTGGT
GTTCGTGGCG AGCATAAGCT GCCGGGTGAA TTTCGTGTCA GTCAAAACTG GATTGGTGGC
TCTAATTTAA CCGATGCAAG TTTTATTCCT CCGCATCCAG AAAGCGTGGC GGAGTTAATG
AGCGATTTAG AAAAGTTCTG GCATAATCAG GACATTGCAG TACCTCATCT TATTCGCATT
GCGTTAAGCC ATTATCAGTT TGAAACCATC CATCCTTTTC TTGATGGTAA TGGACGCATT
GGCAGATTAT TAATTCCACT TTATTTAGTA AGTCATGGAG TACTTGCAAA ACCGTCGCTC
TATCTTTCCG ACTTTTTTGA ACGTCATCGT TCAAGTTATT ACGATGCCTT AATGCACGTT
CGCACCAGCA ATAACCTTAT TCATTGGTTG AAATTTTTCT TAAACGGAGT TGCACAAACA
GCAACAAAGG GAAGAGATAT TTTTCAGCAA ATTTTAACGC TTAGAGAGGA AGTTGAACAA
GCAGTTTTAA GTTTAGGAAA GCGAGCAACA CTTGCGCGTG AAGCGTTGCA TCTGCTGTAT
CGCCAACCAA TTGTAGAGGC AACTGACTTT TCTACTATGC TTAAAGTGAG TGCTCCAACA
GCAAATGCAC TTATTCAAGC CTTGATTGAT AAAGCTATTC TTGTGGAAAT TACAGGGCAG
CAACGAGGGC GAATTTATTC ATTCGAGCGC TACGTAAAGT TGTTTATGGA GTAG
 
Protein sequence
MKFEEFTAGY WQQRYQYKSF EPSLINHEWT WDEPTINTLL EQANCALGEL NAFSLIVPNI 
DLFIQMHVVK EAQTSSKIEG TQTGIDEALL SEEQISPEKR DDWREVRNYI DAVNSAITTL
HDLPLSNRLL KQTHKILLSG VRGEHKLPGE FRVSQNWIGG SNLTDASFIP PHPESVAELM
SDLEKFWHNQ DIAVPHLIRI ALSHYQFETI HPFLDGNGRI GRLLIPLYLV SHGVLAKPSL
YLSDFFERHR SSYYDALMHV RTSNNLIHWL KFFLNGVAQT ATKGRDIFQQ ILTLREEVEQ
AVLSLGKRAT LAREALHLLY RQPIVEATDF STMLKVSAPT ANALIQALID KAILVEITGQ
QRGRIYSFER YVKLFME