Gene Cag_0447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0447 
Symbol 
ID3747372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp524280 
End bp525518 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content49% 
IMG OID637772980 
Producthypothetical protein 
Protein accessionYP_378763 
Protein GI78188425 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.607814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTACC TTTTTGTTCA CCAAAATTTT CCCGGGCAGT TCAAATTTCT TGCCCCAACA 
TTAGCCGCTA ATAAGAGCAA CAAAGTGGTA GCGCTTTGCA TGAAGCCGCA AGCGCCTACC
ATATGGCAAG GCGTTGAGGT TCGTAGCTAT AGCGCCAATC GAGGCACAAC AAAAGGGGTG
CATCCGTGGG TGAGCGATTT TGAAACCAAA ACCATTCGGG CGGAAGCCTG CTTTATGGCA
GCGCAACAGC TTAAAGCTGA AGGCTTTACG CCCGACGTTA TTATTGCGCA TCCCGGCTGG
GGCGAAAGCA TGTTTTTAAA AGAGGTATGG CCTCATGCAA AGCTTGGCAT CTATTGCGAA
TTTTACTACC ACCCCGAAGG TGCTGATGTT GGCTTTGATC CTGAATTTCC ACCGAAAAGT
GAGAGCGACC GTTGCCGCTT GCGGTTAAAA AACCTCAACA ACATTGTACA CTTTCAAATT
GCCGATGCGG GACTTTCACC AACTCATTGG CAGGCAAGTA CTTTTCCCGA ACCATTTCGC
TCCCGCATTA CCGTTGCCCA CGATGGCATT GATACCACGC TGCTCTCTTC AAACGTAGCA
GTGCGCTTAA CGCTCAATAA TAGCCTGACA CTCACTCGTA AGGATGAAGT TATCACCTTT
GTAAATCGCA ACTTAGAGCC ATATCGAGGC TACCACGTTT TTATGCGAGC ACTGCCCGAA
CTGCTGCAAC AGCGCCCAAA TGCACGGGTG CTTCTTGTGG GGGGCGACAA GGTCAGTTAC
GGCGCTAAGC CTGAGGGAGA AGAAAGCTGG AAGGAGCACT TTATTGCCGA AGTGCGTCCA
CGCATCAGCG ATGCCGATTG GGCACGAGTT CACTTTCTTG GAACTATTCC CTACAACATT
TTTGTTCAGT TGCTCCAACT CTCCACCGTA CACATTTATC TCACCTATCC CTTTGTGCTT
TCATGGAGTT TGCTTGAAGC CATGAGCATT GGGTGCGCCA TTGTTGCCAG CAACACCAAG
CCGCTGCTTG AAGCCATTCA CCACAATGAA ACAGGGCAAC TTGTTGATTT TTTTGATGAA
AAAGGATTGG TTGAGAACAT TTGCGAGTTG CTTGATAACA CCAATGAACG CGCACGGCTT
GGCGCTAATG CCCGACGCTT TGCGCAAGCC ACCTACGACT TACGCACCAT CTGTTTACCG
CAACAGCTTG CATGGGTTGA GAGCTTGAGC AAAAAATAG
 
Protein sequence
MRYLFVHQNF PGQFKFLAPT LAANKSNKVV ALCMKPQAPT IWQGVEVRSY SANRGTTKGV 
HPWVSDFETK TIRAEACFMA AQQLKAEGFT PDVIIAHPGW GESMFLKEVW PHAKLGIYCE
FYYHPEGADV GFDPEFPPKS ESDRCRLRLK NLNNIVHFQI ADAGLSPTHW QASTFPEPFR
SRITVAHDGI DTTLLSSNVA VRLTLNNSLT LTRKDEVITF VNRNLEPYRG YHVFMRALPE
LLQQRPNARV LLVGGDKVSY GAKPEGEESW KEHFIAEVRP RISDADWARV HFLGTIPYNI
FVQLLQLSTV HIYLTYPFVL SWSLLEAMSI GCAIVASNTK PLLEAIHHNE TGQLVDFFDE
KGLVENICEL LDNTNERARL GANARRFAQA TYDLRTICLP QQLAWVESLS KK