Gene Cag_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1440 
Symbol 
ID3746639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1907038 
End bp1908237 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content48% 
IMG OID637773975 
Productaspartate aminotransferase, putative 
Protein accessionYP_379740 
Protein GI78189402 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGTTA CTGGAGAGCA ATACCTAACA CAGCGTGTGC TGGGAATGCA GGAATCGCAA 
ACCATACGCA TTACCAATCT TGCAGGTAAA ATGAAAGCCG AAGGACTTGA TATTGTCAGC
CTTTCGGCAG GCGAGCCCGA TTTTCCAACG CCACAGCATG TGTGCGATGC GGGCATTGAA
GCTATTCGCG CAGGCTTTAC GCGCTACACC GCAAACTCAG GTATCCCCGA CTTAAAAAAG
GCTATTGTTG CCAAATTCAA ACGCGACAAT GGGCTTGAGT TTGCTGAAAA CCAAATTATA
GTAAGCAACG GTGGCAAGCA AACGCTTGCC AACACCTTTC TTGCCCTTTG CGCCGAAGGT
GATGAAGTAA TTGTGCCAGC TCCATTTTGG GTAAGCTTTC CTGAAATGGT GCGCCTTGCT
GGTGGCACTC CGGTGATTGT TAATACCACC ATCGAAAGTG GCTACAAACT TACGCCCGAT
CAGCTTGAGG CTGCAATTAC GCCAAAAACA AAAATGCTCG TGCTTAATTC ACCCTCAAAT
CCAACGGGTT CCGTCTATAG CGAAGCCGAG GTTCGTGCGC TTATGGCAGT GCTTGAAGGA
CGTAACATTT TTGTGCTCTC CGATGAAATG TACGACATGA TTGTGTACGA TAATGTTCGT
CCATTTTCAC CAGCCTGCAT TCCTGCTATG AAGGATTGGG TGATTGTAAG TAACGGAGTT
TCAAAAGCTT ACTCCATGAC GGGATGGCGC ATTGGCTACC TTGCAGGACC AAAATGGCTT
ATTGACGCGT GCGATAAAAT TCAATCGCAA ACCACCTCCA ACCCCAACTC CATTGCTCAA
AAAGCGGCTG TAGCAGCGCT TAATGGCGAT CAAAGCATGA TTGAAGAGCA TCGGTTAGAG
TTCCAAAAAC GGCGCGATTA CATGTACGAA GCGCTTAACA AAATTCCGGG CTTTAAAACC
ACCTTGCCAC AAGGTGCCTT TTATATTTTC CCTGATATTA GCGGTTTACT TGGTCGCACC
TTTAACGGCG TTGAAATGAA GGATTCGGCT GATGTTGCAG AGTATTTGCT GAAAGTGCAT
TACTTAGCCA CCGTGCCGGG CGATGCCTTT GGCGCTCCTG CAAACTTGCG TTTGTCGTAT
GCTGCATCAA TTGCAGCGCT TGATGAAGCG TTAAATCGTT TGCGGAAGGC GTTTAGCTAA
 
Protein sequence
MAVTGEQYLT QRVLGMQESQ TIRITNLAGK MKAEGLDIVS LSAGEPDFPT PQHVCDAGIE 
AIRAGFTRYT ANSGIPDLKK AIVAKFKRDN GLEFAENQII VSNGGKQTLA NTFLALCAEG
DEVIVPAPFW VSFPEMVRLA GGTPVIVNTT IESGYKLTPD QLEAAITPKT KMLVLNSPSN
PTGSVYSEAE VRALMAVLEG RNIFVLSDEM YDMIVYDNVR PFSPACIPAM KDWVIVSNGV
SKAYSMTGWR IGYLAGPKWL IDACDKIQSQ TTSNPNSIAQ KAAVAALNGD QSMIEEHRLE
FQKRRDYMYE ALNKIPGFKT TLPQGAFYIF PDISGLLGRT FNGVEMKDSA DVAEYLLKVH
YLATVPGDAF GAPANLRLSY AASIAALDEA LNRLRKAFS