Gene Cag_1473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1473 
Symbol 
ID3746442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1939060 
End bp1940211 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content38% 
IMG OID637774007 
Producthypothetical protein 
Protein accessionYP_379772 
Protein GI78189434 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.574733 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCAA AACCCTCCAT TCTTCTCTTT TCTGAAGATT TTCCACCCAA CTATGGCGGC 
ATTGGACAAT GGGCAATTGG TGTAGCACAA AGCATACATC GTATGGGATA CCCACTACAC
CTTCTCACCC GCTATATGAA TCCTGAAGCT GAGATGCTAC AAAACCGTGA ACCATATCCT
GTAATACAAG TTCATGGTAA ACGATGGAGT CAATTCCGTT CTTTCTATAC CTATAGTGCC
ATAAAAAATA TTTATAAAAA AGGCATAAAA CCAGATATAG TTATTGCCAC AACATGGAAT
ATTGCGCGAG GCATAACAAG GATTCTAAAA AAAAATAAAA CAAAGCTTGT TATAGTTGTT
CACGGCTTAG AAGTTACGCG GACTATGCCG TGGCTAAAAA CACGGTGGCT ACAACAAACC
CTTAATGCAG CAGATGCTGT TATTGCTGTA AGTAATTTCA CTCGCGATCG TGTTATAGAG
CGTTGCAATA TTAATCCATC AAAAGTCCAC TTCCTCCCTA ATGGAGTTGA CCCACAACGT
TTTTTTCCAC GAAGCAACAC AACCCACTTA CAAGAAAAAT ATAATCTACA CAATAAAAAA
GTTATTTTAA CTCTTGCCCG TTTACAAGAA CGAAAAGGAC ACGATAAAGT TATTGAAGCT
TTACCTACTG TTTTAAAAGA AATTCCTAAT GCTCATTACC TTATATCAGG TGCACTAAAA
GGGACATACT ATAAAACATT ACAACAACAA GTAAGCAATT TACGTCTTAA CGAACATGTA
ACCTTTACAG GTTTTGTTGA CTCCGCTGAT TTAAATGCCT TTTATAATGT ATGCGATGTC
TATATTATGC CAAGTCGAGA ACTTGAGAAA AAAGGGGATA CAGAAGGATT TGGCATTACT
TTTTTAGAAG CAAATGCGTG CGAAAAAGCG GTTATTGGCG GACGCTCAGG TGGGGTGGCT
GATGCTATTG ATGATGGTAA AACAGGCTAT CTCGTAAACC CATTAGATAG CAATGAAATT
GCAGAAAAGT TGATTTACTT ACTGAGTAAC CCAGAATTAG CTACACAATT TGGAAAACAA
GGCAGACAAC GCATACTTAC AAGCTACACA TGGGATGCTG TTACTAAAAA ATTGCTCGCA
ACAATAGCAT AA
 
Protein sequence
MASKPSILLF SEDFPPNYGG IGQWAIGVAQ SIHRMGYPLH LLTRYMNPEA EMLQNREPYP 
VIQVHGKRWS QFRSFYTYSA IKNIYKKGIK PDIVIATTWN IARGITRILK KNKTKLVIVV
HGLEVTRTMP WLKTRWLQQT LNAADAVIAV SNFTRDRVIE RCNINPSKVH FLPNGVDPQR
FFPRSNTTHL QEKYNLHNKK VILTLARLQE RKGHDKVIEA LPTVLKEIPN AHYLISGALK
GTYYKTLQQQ VSNLRLNEHV TFTGFVDSAD LNAFYNVCDV YIMPSRELEK KGDTEGFGIT
FLEANACEKA VIGGRSGGVA DAIDDGKTGY LVNPLDSNEI AEKLIYLLSN PELATQFGKQ
GRQRILTSYT WDAVTKKLLA TIA