Gene Cag_1416 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1416 
Symbol 
ID3747175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1886457 
End bp1887902 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content44% 
IMG OID637773952 
Productalpha amylase domain-containing protein 
Protein accessionYP_379717 
Protein GI78189379 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTCCTT TGGTTTATGA AATCAACACC CGTATTTGGT TACGGCAATT AAGTGAATAT 
TATAATCAAC CTATTACCCT TGGCTCAGTT CCCGATAGCG AGTTCTCATT TTTTGCCCAA
TGTAATTTTG ATATTATTTG GCTCATGGGT GTTTGGCAGC CCAGCAAGTA CAGTAACGCC
ATAGCAACTT CACACCCCAG CTTGCGTAAA GGCTTTTTAG CACACGTTCC CCACGATCTT
AAAGCTGACG ATATTACAGC CTCCCCTTAC TCAATTCCAA CCTACACCAT CAACGATGCA
CTTGGCGGTA ACGATGAGCT GTTAGCCTTT CGCGCACGGC TGCACCGCAT TGGCATTAAG
CTGATGCTTG ATTTTGTGCC AAACCATTTG GCGCTTGATA ATGAGTGGCT GCCAGAGCAT
CCCGAATTTT TTATGCCACT ACGTGAAGAT GAGCATAGTC AAGATCCTAA TGCAGGGTTT
GAATATGTTG CCAACTCCTA CCTTGCCTAC GGCAAAGACC CATACTTTGC ACCATGGACC
GACACGTTGC AGTTAAACTA TGCCAACCTT GCAACGCACG ATATGATGAC CGAAAATCTC
ATGAAAATTG GCGCATTAGC CGATGCCGTT CGCTGCGACG TTGCTATGCT GATTTTAAAA
AGTGTTTTTA ACACCACATG GAGTTCCCTT GGTGGGCAAA TGCACAAAGA ATTTTGGTTC
GACGCTATTT CATCGGTTAA AAAGCGTTAC CACGACATGA TCTTTTTAGC CGAAGCTTAT
TGGAATAAAG AGTGGGAATT GCAAATGCAG GGTTTTGATT TTACCTACGA TAAACCTTAC
TACGATTATG TAACCAATGC ACCTGTAGTG GTTGATAAAC TGTCGGGGCA TTTAAGTGGT
GGATGGGACT ATCAGCAAAA ACTTTGCCGC TTTCTTGAAA ACCACGATGA ACCGAGAAGT
GCCGCTAAAC TTGGTTTAAA CAACCGTGCC GCCGCTGTAG TGCTACTTAC CACACCAGGA
ATGCACCTTA TTCACCAACA ACAAATGGTG GGCTATAAAA AGCAAATGCC CGTCCAACTA
CTTCGCCAAG CGGTAGAGCC TGAAGATGGT GAATTAGCCG CTTTGTACGA ACAACTTTTT
GCTTTACAAA CGCACGAAGT ATTTCAGCAC GGAGGCATTG AATGGCTTGA TCTTAATGTT
TGCCACTACT GCCATTGCTT TGGCTTCCGC CGTTACCATG ATGAAAAAAA TGCCTTTGTT
ATCGTAAACT TTAGTCCATT TGGCATGGAT CTTACCTTTT CTCATGCAGC ACTTGAAAAT
ATGGAAGGCA AAGCGCTCCA CACCCTCAGC TCAACCGGCA AGTTGGCTGA AAATGAGCTT
TCCGTTGAAG GACGTGCAGT TAAAGTAACG CTTTCACCGC ATGAGGCGCT TGTGATGTAC
AATTAA
 
Protein sequence
MFPLVYEINT RIWLRQLSEY YNQPITLGSV PDSEFSFFAQ CNFDIIWLMG VWQPSKYSNA 
IATSHPSLRK GFLAHVPHDL KADDITASPY SIPTYTINDA LGGNDELLAF RARLHRIGIK
LMLDFVPNHL ALDNEWLPEH PEFFMPLRED EHSQDPNAGF EYVANSYLAY GKDPYFAPWT
DTLQLNYANL ATHDMMTENL MKIGALADAV RCDVAMLILK SVFNTTWSSL GGQMHKEFWF
DAISSVKKRY HDMIFLAEAY WNKEWELQMQ GFDFTYDKPY YDYVTNAPVV VDKLSGHLSG
GWDYQQKLCR FLENHDEPRS AAKLGLNNRA AAVVLLTTPG MHLIHQQQMV GYKKQMPVQL
LRQAVEPEDG ELAALYEQLF ALQTHEVFQH GGIEWLDLNV CHYCHCFGFR RYHDEKNAFV
IVNFSPFGMD LTFSHAALEN MEGKALHTLS STGKLAENEL SVEGRAVKVT LSPHEALVMY
N