Gene Ccur_01800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcur_01800 
Symbol 
ID8374388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCryptobacterium curtum DSM 15641 
KingdomBacteria 
Replicon accessionNC_013170 
Strand
Start bp219309 
End bp220736 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content52% 
IMG OID644993103 
Productaspartyl aminopeptidase 
Protein accessionYP_003150593 
Protein GI256826634 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value0.000552066 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCAAGA AAAACGCTTG GAAATCGTAC GATGCAGCCG ATGAAGCGCA GCTCAACCAG 
CTTAATAACG ATTACATCGA CTTTATCAGT ACGTGTAAGA CCGAACGCGA ATGCGCCAAC
CGTGCCATTG AGATGGCACG TGAAGCCGGC TATATCAGTC TTGAAGAAGC ACGTATACAG
AAGCGTCCGC TTGCACCGGG CGCCAAGGTG TACGCAAGCG TATACGGCAA AACAGTGATG
CTTGTGCATT TAGGTAAGCG CCCGCTGGAA GAAGGGTTCA ATATCCTTGG TGCGCATATC
GATTCGCCCC GCCTTGATAT CAAGCAGAAT CCCCTGTATG AGGCTCAGGG ATTTGCCCTG
CTTGATACCC ATTATTACGG TGGCATCAAG AAGTACCAGT GGGTTACGCT GCCTTTAGCT
ATCCATGGGG TAGTCGTTAA GCCGAACGGT GAATCTATCA CGGTAAATGT AGGTGAAGAT
GCGACCGACC CGGTCTTTTG CGTTACCGAT CTTCTCATTC ATCTTGGCGC CGATCAGTTA
GAGAAGAAGG GCGGCAAAGT GGTCGAGGGA GAAGACCTCG ATTTATTGGT AGGCAGTCGT
CCGTTTGTGC TTGATAAACA AGACATCAAA GATGCTGCTG AAGGGTCGCT CGAAGAAATG
GCAGCGACCA GCCCGGTGAA AGCGGCTCTG CTTTCTCTCT TTCAGGATAA GTACGGCTTT
ACCGAAGAAG ATTTTCTGTC GGCTGAATTG GAAGTGGTGC CAGCAGGGCG TGCACGCAGC
TGTGGCTTTG ATGCAAGTAT GGTACTGGGC TACGGGCAGG ACGATCGCGT ATGTGCCTAC
ACGAGTCTGA TTGCTCAGCT TGAAGCCGAA GATATACAGC GCACCGCTAT CTGTTTGCTC
GTGGATAAAG AAGAGATTGG CAGTGTGGGT GCAACGGGTA TGACCAGTCT GTTCTTCGAA
AACACTGTTG CCGAAATCAT GGAGTTGGCT GGGCAGGGCG GCGACTTGGC GCGTCGCCGT
GCGCTGGCTG CTTCCGATAT GCTCTCCAGC GACGTGAGTG CAGGGCTCGA CCCGTTATAC
GCCAGTGCCT TTGAAGAAAA GAATGCCGCT CATTTGGGTT GTGGCCTCGT ATTCAACAAG
TTCACCGGTG CCCGCGGGAA AAGCGGCAGC AACGATGCAA ATGCCGAATA CATGGCAAAA
ATTCGTGCCA TTATGGATGG TGGTGCTGTG CGCTTCCAAA CAGCTGAGCT GGGAAAAGTT
GATCAAGGCG GCGGCGGGAC AATCGCCTAT ATCCTGGCAA AATATGGCAT GAACGTTATT
GACTGTGGCG TGGCGGTTCT TTCAATGCAT GCTCCGTGGG AAGTGGCGAG TAAGGCCGAT
ATTTACGAAG CAAAGAAAGG CTATATAGCA TTTTTGCAAC AAGCCTAA
 
Protein sequence
MGKKNAWKSY DAADEAQLNQ LNNDYIDFIS TCKTERECAN RAIEMAREAG YISLEEARIQ 
KRPLAPGAKV YASVYGKTVM LVHLGKRPLE EGFNILGAHI DSPRLDIKQN PLYEAQGFAL
LDTHYYGGIK KYQWVTLPLA IHGVVVKPNG ESITVNVGED ATDPVFCVTD LLIHLGADQL
EKKGGKVVEG EDLDLLVGSR PFVLDKQDIK DAAEGSLEEM AATSPVKAAL LSLFQDKYGF
TEEDFLSAEL EVVPAGRARS CGFDASMVLG YGQDDRVCAY TSLIAQLEAE DIQRTAICLL
VDKEEIGSVG ATGMTSLFFE NTVAEIMELA GQGGDLARRR ALAASDMLSS DVSAGLDPLY
ASAFEEKNAA HLGCGLVFNK FTGARGKSGS NDANAEYMAK IRAIMDGGAV RFQTAELGKV
DQGGGGTIAY ILAKYGMNVI DCGVAVLSMH APWEVASKAD IYEAKKGYIA FLQQA