Gene Cag_1059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1059 
Symbol 
ID3747042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1440094 
End bp1441092 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content46% 
IMG OID637773590 
Producthypothetical protein 
Protein accessionYP_379364 
Protein GI78189026 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0702] Predicted nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000180564 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATAACA AAAAGCGGGT GTTTGTTGTT GGTTCAACGG GCTACATTGG TAAGTTTGTG 
GTGCGCGAGT TGGTGGCGCG AGGTTACCAT GTGGTGAGTT TTGCTCGTGA GCGTTCGGGG
GTTGGTGCTG CCACAACGGC TGAGCAGCTT CGGCAAGATT TAAAGGGTTC GGAGGTGCGT
TTTGGGGATG TGGGCAACAT GCAATCGTTG CGTGCCAATG GTATTCGGGG TGAGCATTTT
GATGTGGTTG TCTCTTGCTT AACCTCGCGC AATGGAGGCA TTCAGGATTC GTGGAATATT
GATTATCAAG CAACGCGCAA TGCGCTTGAT GCCGCTAAAG CGGCTGGTGC AACGCAGTTT
GTGCTGCTTT CGGCAATTTG TGTGCAAAAG CCTATGCTGG AGTTTCAGCG GGCAAAGCTG
AAGTTTGAGC GTGAGTTGCA GGAATCGGGG TTAACGTGGT CAATTGTGCG TCCAACAGCC
TTTTTTAAGT CTATTGCGGG GCAGGTTGAA GCGGTAAAAA ATGGTAAGCC TTTTGTGATG
TTTGGCAATG GTCGTTTAAC GGCATGTAAA CCTATTAGTG AAGCTGATTT GGCGCGTTAC
ATTGTTAATT GCATTGATGA TAGTTCCATG CAGAATAGAA TTTTACCGAT TGGTGGACCT
GGTCCTGCTA TAACGCCGCT TGATCAAGGG ATGATGCTTT TTGAATTGCT GGGTCGTGAG
CCAAAGTTTA AGAAAATGCC CATCCAAATG TTTGATGTTA TTATTCCCGT GCTTGCTTTG
CTTGGTAAAA TTTTTCCGCA GTTTAAGGAA AAGGCGGAGT TTGCACGAAT TGGGAAATAT
TATTGTTCAG AATCAATGCT TGTGCTTGAT CCAAAAACGG GTAACTATAA TGCTGCAATA
ACGCCTTCGT TTGGGAGTGA TACGTTACGT GAGTTTTATG GTCGAGTGTT GAAGGATGGG
TTGAAGGGGC AGGAGTTGGG TGAACATGCA ATGTTTTAA
 
Protein sequence
MDNKKRVFVV GSTGYIGKFV VRELVARGYH VVSFARERSG VGAATTAEQL RQDLKGSEVR 
FGDVGNMQSL RANGIRGEHF DVVVSCLTSR NGGIQDSWNI DYQATRNALD AAKAAGATQF
VLLSAICVQK PMLEFQRAKL KFERELQESG LTWSIVRPTA FFKSIAGQVE AVKNGKPFVM
FGNGRLTACK PISEADLARY IVNCIDDSSM QNRILPIGGP GPAITPLDQG MMLFELLGRE
PKFKKMPIQM FDVIIPVLAL LGKIFPQFKE KAEFARIGKY YCSESMLVLD PKTGNYNAAI
TPSFGSDTLR EFYGRVLKDG LKGQELGEHA MF