Gene Cag_1275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1275 
Symbol 
ID3748313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1737991 
End bp1739268 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content41% 
IMG OID637773813 
Producthypothetical protein 
Protein accessionYP_379579 
Protein GI78189241 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.370797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGATG TTCAAGTTTC ACACAACAGT GTTGGCACCC TTGCACACCA CACATCGGAG 
CAAGGTAGCT ACACCTTTGC CTACCATAAA GCTATTGATA TTGGGCAGGA AGTATCGCTT
ACTATGCCAT GGTCGCTTGC AAGTTACCAT TACCGCAAAG GGCTTCACCC AATTTTTCAA
ATGAATTTGC CCGAAGGTAG GTTACGCTAC ACCCTTGAGC GTGCTTTCCG TAAACAGGCA
CAAGGATTTG ATGATTTAAT GTTACTTGAT ATTATTGGGC ATTCGCAAAT TGGGCGCTTA
CATTGCACAA GTAACCCTCA GCTCCCTAAA TCGGTGCCAT TGCAAAGTAT TAACGAGCTG
TTAGCTTACA ATGGCACTGA AGATTTGCTT CGTGATTTAC TTGAACGTTT TTCAGCCACA
TCAGGCATTT CAGGCATTCA GCCAAAAGTG CTTATTTGTG ATCCCAATCA AGCAGCATTA
GGTGCAAAAT TTCCAACACA CCATAGTCCA CAGCTCACTA ATGCCCAAGC ACGCATTACC
GTAAAGGGAG CAACACATAT TGTAAAAGGA TGGGATGAAA ATGAGTATCC TCACCTTGCC
TTAAATGAGT GGTTTTGCAT GAAAGCTGCA AAGCAAGCAG GTTTAGAAGT ACCACGTATC
TTTCTCTCCG AAAATTATCA ACTGCTTATT CTGGAGCGTT TCGACCTTTT AGAAGATGGA
ACCTATCTTG GATTTGAAGA TTTTTGCGCT TTACATGGAT TAAGTACGTT TGAAAAGTAT
GATGGTAGTT ATGAACGCGT AGCGAAACGC ATAACACAAT TTGTAAGCCA AGAGCATCGC
CAAAAAGCAT TCGAAGAGTA TTTCAAAATT GTTGCTCTTT CATGTGCTGT ACGTAACGGC
GACGGGCATC TTAAAAATTT TGGCGTCCTT TATTCCAACA CCACAAGCGA TGTATGGCTC
TCTCCAGCAT ATGATATTGT TTCAACAACC CCCTACATTC CACGAGACTC GTTAGCATTA
ATGTTAGATG GCAGTAAACG TTTTCCTTCT CGAAAAAAAC TCTTGAATTT TGCCCGTCAA
CACTGTAACC TACAACACGA GCAAGCTACC GAAATGATGG AAAAGATAGG TGATGCCGTT
AATGAAACAA TGGCTGAAAT AAAAGTACAG ATAAAGGAGT ATTCTCCATT CGCATCAATC
GGCAATAGAA TGCTTAGCAC ATGGAATGAA GGAATAATAG ATCTCAATGG GAAATCCACC
ATCTCGTTCT CTACATAA
 
Protein sequence
MLDVQVSHNS VGTLAHHTSE QGSYTFAYHK AIDIGQEVSL TMPWSLASYH YRKGLHPIFQ 
MNLPEGRLRY TLERAFRKQA QGFDDLMLLD IIGHSQIGRL HCTSNPQLPK SVPLQSINEL
LAYNGTEDLL RDLLERFSAT SGISGIQPKV LICDPNQAAL GAKFPTHHSP QLTNAQARIT
VKGATHIVKG WDENEYPHLA LNEWFCMKAA KQAGLEVPRI FLSENYQLLI LERFDLLEDG
TYLGFEDFCA LHGLSTFEKY DGSYERVAKR ITQFVSQEHR QKAFEEYFKI VALSCAVRNG
DGHLKNFGVL YSNTTSDVWL SPAYDIVSTT PYIPRDSLAL MLDGSKRFPS RKKLLNFARQ
HCNLQHEQAT EMMEKIGDAV NETMAEIKVQ IKEYSPFASI GNRMLSTWNE GIIDLNGKST
ISFST