Gene Aasi_1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1079 
Symbol 
ID6377383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1389821 
End bp1390954 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content44% 
IMG OID642682192 
Producthypothetical protein 
Protein accessionYP_001958153 
Protein GI189502436 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.951143 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTATTA GCAAGCTGGT TATAAAAGCA TATAGTAATG AGAGCTTTAC TACCCAAAAG 
GGTGAGTTTT CAGCTTCTAT CAACCCTGCA AACCTCAAAA TTACAAGTAG TGTAGATTAT
GAGAGGTCTC AAGGTATGGG CTCAGCCAAT ATGGCGCTTC GCTATAACGT TTCGCCTCCT
AAGGAACTGT CGTTTAGACT CATCTTTGAT AATACAGGGA TCTTTCCAGA CTCAGACAAA
AGTGTAAAAG ATCAGCTAGA AGCTTTGCAA GACGTGGTAT ATAAGTTCCA GGAAGATATT
AATTCACCTT ATTACGTGCG GGTTATCTGG GGTGTAATTG ATTTTAAAGG TAAATTGGTT
GGTTTGGAGA CAAGTTATAC CATGTTTAAG TCAGATGGTG CTCCAATCCG AGCAGAAGTA
GATATAGTGG TATTAGAAGA TGCAAGCGCA AGCAAGATTG CCACAGCTGC AAAAGCAGCT
GCGAAAACGG CTAATACAGC CACTACTGCA GTATTAGGGG CAGCAACTGG TGCAGCGGCT
GGGGCCGCAG CCGCTGCCGT AACCGTAGCA GCAGCGTCGG TTGCTGTAAG CCCTAATGCA
CCACCTAGTG TAGCGCCTGA TGCTACAACT GCTGGAGCTA CACTCACAGA ATCAGAACTT
GCTGACGCCA GCACACCAGA CACTGCAGGA GCAAAAGCAA ACACTAGTGC AACTGGCACT
TCACAAGCAG GAGGAGAACC AGCAGCAGGA ACAAATGCCG CAGCCACCAC TACCAATCCT
CAAAATGCAG ATACAAAGAA TATAGAGCAG GCACCTGGCG CAACTGCAGC TGCTACCCCT
ACTACGGTAC AACAAGTGAC ACCCAAAGAT AAATTAACTG GTGTTGCTAA AAATTCATTA
GGAGATCCAA ATCTTGCTAA ATCACTAGGC CGTGTAAATG GATTAGACAG CCTTAGGAAC
TTAGCTTCAG GGCTCTCACT AGCTGTTCCT TTAACATCGC TGGGGCTTTT AGCAATGCTA
TTGGCAATGG CAAAAAAATA CGGTTCAAAG GGTGCTAATT ATTTAAAAAG TAAGGCAAAA
ACAGGTAAAA ACAAGGCTGT TGCAGCTAAA GATAAGGTAA AGAGTAAACT TTAA
 
Protein sequence
MSISKLVIKA YSNESFTTQK GEFSASINPA NLKITSSVDY ERSQGMGSAN MALRYNVSPP 
KELSFRLIFD NTGIFPDSDK SVKDQLEALQ DVVYKFQEDI NSPYYVRVIW GVIDFKGKLV
GLETSYTMFK SDGAPIRAEV DIVVLEDASA SKIATAAKAA AKTANTATTA VLGAATGAAA
GAAAAAVTVA AASVAVSPNA PPSVAPDATT AGATLTESEL ADASTPDTAG AKANTSATGT
SQAGGEPAAG TNAAATTTNP QNADTKNIEQ APGATAAATP TTVQQVTPKD KLTGVAKNSL
GDPNLAKSLG RVNGLDSLRN LASGLSLAVP LTSLGLLAML LAMAKKYGSK GANYLKSKAK
TGKNKAVAAK DKVKSKL