Gene Aasi_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAasi_1014 
Symbol 
ID6377021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Amoebophilus asiaticus 5a2 
KingdomBacteria 
Replicon accessionNC_010830 
Strand
Start bp1317881 
End bp1320277 
Gene Length2397 bp 
Protein Length798 aa 
Translation table11 
GC content34% 
IMG OID642682134 
Producthypothetical protein 
Protein accessionYP_001958095 
Protein GI189502378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATACAAA ATATTTGCAA CTATTCTTAT CATCCTAATG ATTATGTACA TGGGCTACTT 
GCTAGTCATA TATACTATCC TAAACATAAA AAAGGAGATC AAGTAAAACT TCAAAGTATA
TCTAAGCAAT TAGGGCATGA ATTACCGCCA AATCCTCAAA ATATTTGGGA AATAGTCCAA
GTAGAAGATG ATACGGATGG GACAGGCTAT TGTAGTAATT TATATGTAAA TAAAAATATG
CAACAAGCTG TACTATCATT CCAAGGTACT CAGGTTGAAA GCATTGTTAA AATACTAGAT
AATAAAGATT TGAAAGAGGA CTTAGCGAGC ATACTGGCCA ATAAAATTAC AAAGCAGCAA
GCACTTGCTT ATAAAGCAAC AAAAAATGCT GTTAATTATG CTAAGGGAAA AGGACTATCT
TTATCTTTTA CAGGTCATTC CTTGGGTGGT TATTTAGCAG AGTTAGGAGT GGCATTCTGT
TATCTCGATG CTGAACTTGA TTATAGAGAG GTTAAAGCAG TTGTATTTGA TAGTCCAGGA
AGCGGAGAAA AAATAAACCT TCTTAAATCT AATAGTACGG AGTTTGATAT TCAAAAGTTG
CCCATAGTTA CTTACCTTTC AGCACCTAAT ATTGTAAATT CATGTAATGG GCATCCAGGA
GAAATCTGTA TAGTCCATCC AGAGCTAAAG TTAAAAGACT GGGCAATAAA ATATATAGAA
GCGGTAAAAA GTTGGCCCTT GGTAGGTAAA AATATGGTTA GTATTGCCAA GTGTCTATTA
TCACTTACAG GGCATAGTCT AAATACTATC CTTGCATCAT TTGATCCAAA AACAGGTAAG
CCATTTAAAT ATATACGCAT AGGTGATTGG CCAAAGTTCG ACCTAAAAAG GCTAAATCAT
AAATCTTATC TAGGCAATAA AGGAAGAGTA GGTGGGGCTA TAGTGTCAAA GCTTGCCCGG
ATCATAAATA TTCCTATGGG TGGATTTATA GGCTTCCAAG CTGGAGCTTT TATAGAAAAT
AAAATAGATG AACGCCTATG TCTTTTAAGT AGTATAATAG GATTTTTATT AGACTATAAA
AAGATAGATG GAAAGCAGCT CTTGCAAACA TTAGAAGAAT TAGATGAAAA TTATAACAAG
CCTGAAGATG AAACAGCTGA GAATGCTTTT AGACTCAAGT ATATAGGACA CTATAAAGAA
AGCGGGTTAA AACTTAATCA ATATAAAGTA CATAAACATA AGAATAAAAG TGTTGATTGG
TACCTGTATA AGTTAAGGAA GTATGCTAGA GATAAAAGTG TAATAGATAG GTTGAGTAAC
GGAGATTTTA CCATACGGGT ATTACAAAAT ATTTTAAAAG ATTATGATAT TGTAACCATA
TCTGAAAGCC AGTATATCGA ACTTAATACA GAACAAGGAG ATATAGAGGT GTTACGAGCT
AAAATGCGTA GGAATTTAGA AATACTGACA GCTAAAGAGA TTGAGCATGC TATGCACAGT
ATACGTACAC TCACAGCAAA AAGGTTATCA GTTCGACGTA AAGCTGATTC TTTTGCTCGT
GTGAAGAACG GCCATGAGCA AGCAAAGCCT ACTCATGAAA TGAATAAAAT TGTAAATAAG
TCTTTAAAAA GGCAAAACTT TTGTGATAGA CAAATAGACT ACCCACATAA GGAAGAAAAA
CAATACGATT TAAAAAAAAT TATAATTCTA TCTATTTTTA TATTCTTCTT ATTATCATTA
CCTATATTAG CTTATATATT TCACCTCACT CTTAGGCAGA ATCACGTAGA AAAATCTGAA
AATGAATCTA TTATAGAACA TTGGCAAAAT CACAATTTAA CAGAAAACGT GCAAAATAAT
TATAATGTCT TACAGGAAAT AGGAGGAAGT ACAGTTAATT CTAGTTATAA TAATGAGGAC
GCTGAATGGG CAGCATCTAT TGTTCAAGAT ATAAGAGCAA AGCAACTAGA CACCGTTTAT
TTGCATATTG CTGCAGATTT AACTCCTAAA AGAGCGGCGG TTCTTGGTAG AAATTTACAA
GGAACACAAG TGCATACAGT TCGGTTAAAC CATATTGTAA ATGGAGATAA CATAATAACG
GCTCTTGCTA AAAATCTGGA AGGAACGCAA GTACACACAA TTGTTATAGT ATCCAGTGAC
ATAGGTTTTG GTTATATAAA TGATTTTTTC AATGTGAGAG CAGCAGAATT TGCTCAAAAC
TTGCGAGGAA CTCAAGTACA CACAATTGCT ATAGTATCCA GTGACATAGG CAACAGATGG
GCTATAGAAT TTGTTAAAAA TCTAGAAGGG ACCCAAGTAC ATACGGTTGA TTTCAGTGAT
AGTATTATAA GTGATAAAGA GGAAGGCCGA TATCTAATTG AATGGGTTTT TGACTAG
 
Protein sequence
MIQNICNYSY HPNDYVHGLL ASHIYYPKHK KGDQVKLQSI SKQLGHELPP NPQNIWEIVQ 
VEDDTDGTGY CSNLYVNKNM QQAVLSFQGT QVESIVKILD NKDLKEDLAS ILANKITKQQ
ALAYKATKNA VNYAKGKGLS LSFTGHSLGG YLAELGVAFC YLDAELDYRE VKAVVFDSPG
SGEKINLLKS NSTEFDIQKL PIVTYLSAPN IVNSCNGHPG EICIVHPELK LKDWAIKYIE
AVKSWPLVGK NMVSIAKCLL SLTGHSLNTI LASFDPKTGK PFKYIRIGDW PKFDLKRLNH
KSYLGNKGRV GGAIVSKLAR IINIPMGGFI GFQAGAFIEN KIDERLCLLS SIIGFLLDYK
KIDGKQLLQT LEELDENYNK PEDETAENAF RLKYIGHYKE SGLKLNQYKV HKHKNKSVDW
YLYKLRKYAR DKSVIDRLSN GDFTIRVLQN ILKDYDIVTI SESQYIELNT EQGDIEVLRA
KMRRNLEILT AKEIEHAMHS IRTLTAKRLS VRRKADSFAR VKNGHEQAKP THEMNKIVNK
SLKRQNFCDR QIDYPHKEEK QYDLKKIIIL SIFIFFLLSL PILAYIFHLT LRQNHVEKSE
NESIIEHWQN HNLTENVQNN YNVLQEIGGS TVNSSYNNED AEWAASIVQD IRAKQLDTVY
LHIAADLTPK RAAVLGRNLQ GTQVHTVRLN HIVNGDNIIT ALAKNLEGTQ VHTIVIVSSD
IGFGYINDFF NVRAAEFAQN LRGTQVHTIA IVSSDIGNRW AIEFVKNLEG TQVHTVDFSD
SIISDKEEGR YLIEWVFD