Gene Cag_1740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1740 
Symbol 
ID3746523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2259095 
End bp2260195 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content47% 
IMG OID637774277 
Productaminopeptidase P 
Protein accessionYP_380034 
Protein GI78189696 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCTC TTACCCTCCA ATTACAGCAC TATCGCCAAA GTAGCTATCA GCACATTGTG 
CAAAAAATGG TGAACCTTGC ACTTGATGCC TTTATAGTTA CAGAACTACC CATTATCCGA
TGGCTTACAG GTTTTAGCGG CTCCTCTGCT CGTTTACTGA TTACGCGCGA AAAGGTGTGG
CTTTTTACCG ACTTCCGCTA TCAAGAGCAA GTACGCCACG AAGTTACCCT TGCCGAAACG
GTTATTGTTG CCGAAGGTTT TATTGCAGAA CTTTTGTTGG GCAACTATCC ATGCGGCACA
ACAATTGCCT TGCAAGCCGA ACACATTACA TGGCAGGAAG CTAATCGTTT ACGCGACAAA
GTGTTTCATG CTCAGCAAGT AATGCCTATT GAAGGTTTTT TTAATGAATT CCGCATAATA
AAGCAGGCAG TAGAACTTGA CTACATGCAA CGCGCTGCGG CTCTTAGCGA AGCGGCACTT
GAAGCGGTGC TTCCCATGAT TTCCCCCAAT GTTACCGAGC TTGATATTGC CGCAGAACTA
AGCTACCAGC AAAAAAAACG AGGTGCTTCA GGCGATTCAT TTTCCCCCAT TGTGGCAAGC
GGAGCACGAG CAGCAATGCC CCACGCAACT CCCACCAACG CCCATTTTGT GCAAGGTGAA
CTTATTTTGC TCGACTTTGG CTGTATGTAT GAAGGCTACG CCTCCGATCA AACCCGCACG
GTGGCACTTG GTAAACCCTC AAAACAAGCA AGTACCATTT ATAACATTGT ACGAAAAGCG
CAGCAACTTG GTTTAGAGCG CGCTCAATGC GGCATGAAAG CACGAAAGCT AGACGAGGTG
GTGCGCCGTT TTATTACCAA ACATGGCTAT GGCGAACAAT TTGGGCACGC ACTTGGGCAC
GGTATTGGGC TTGAAGTACA CGAAGAGCCT CGTATTAGCT CCCGCAGCGA AACCATTTTG
CAAGAGATGA TGCTTTTTAC CATTGAACCG GGCATTTATC TCCCCAATTG CTGTGGGGTT
CGCATTGAAG ATACGGTAGT TATGGGCACA CAAGGGGCTA TGCCGCTTCA GCAATTTAGC
AAAGAACTTA TTGTGCTTTA A
 
Protein sequence
MDSLTLQLQH YRQSSYQHIV QKMVNLALDA FIVTELPIIR WLTGFSGSSA RLLITREKVW 
LFTDFRYQEQ VRHEVTLAET VIVAEGFIAE LLLGNYPCGT TIALQAEHIT WQEANRLRDK
VFHAQQVMPI EGFFNEFRII KQAVELDYMQ RAAALSEAAL EAVLPMISPN VTELDIAAEL
SYQQKKRGAS GDSFSPIVAS GARAAMPHAT PTNAHFVQGE LILLDFGCMY EGYASDQTRT
VALGKPSKQA STIYNIVRKA QQLGLERAQC GMKARKLDEV VRRFITKHGY GEQFGHALGH
GIGLEVHEEP RISSRSETIL QEMMLFTIEP GIYLPNCCGV RIEDTVVMGT QGAMPLQQFS
KELIVL