Gene Apar_0392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0392 
Symbol 
ID8413241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp453323 
End bp454378 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content45% 
IMG OID645021960 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_003179414 
Protein GI257784197 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGCAT CAAATTGCAA GAGTATGACA AGAAAAGGCT TTCTAGCTGC GATGGGTTTC 
TCGACAGCAG GATTTTGCGC AGGTTGCTCG CTTGAACAAC CATCTGCTCC GTCAAACGGT
GGTAACAATG GCACGGGCCA AAATACTGGT GGTACTACAG AGATTACGTT TGCCCTGGAC
TATACGCCAA ATACCAATCA CACCGGCATT TATGTAGCTC AGGAAAAGGG CTACTTTGAC
GAAGTAGGCC TCAAGGTAAC CATTCAGCAG CCACCCGCTG ACGGTGCTGA TGCGTTGATT
GGTGCTGGTG GTGCCCAGAT GGGTGTTACC TATCAGGACT ATATTGCTAA TAGTCTCTCG
TCATCTAATC CACTACCGTA TACTGCGGTT GCCGCTATCA TTCAGCACAA TCTTTCGGGC
ATTATGAGCC GTGAAGATGA TCATATTGTT CGTCCGCGTG ACCTTAATAA CCATACTTAC
GCAACGTGGA ATCTTCCTGT TGAACAGGCT ACTATCAGGT CTGTTATAGA GTCTGATGGC
GGAGATCCTT CAACGCTTAA GATGGTACCT TATGAGGTAG ATGATGAGGT ATCTGGCTTA
AAAGCAAAGA TGTTTGATGC TGTTTGGGTG TATGAGCAGT GGGCTGTTCA AAACGCTCGT
GTTCAGAATT TTGCATACAA TTATTTTGCC TTTTCAGCTA TTGATCAAAA CTTTGATTAC
TACACGCCAG TCATTGCGGC AAACGATGAC TTTGCAAAAA AGAACCCAGA TGCTGTTAAA
GCATTTTTGA GCGCTACCAG AAAGGGTTAT GAGTTCTGTG TTTCTAATCC TGACGAGGCA
GCAGAGATTC TGCTCAAAGC TGTTCCAGAA CTTGATGCTG ATCTGGTCAA AGCCTCTCAG
AAGTTCCTAG CCTCTAAGTA TATTGACGAC GCTGAGAAGT GGGGCGTTAT TGATTCAGCT
CGCTGGCAGC GTTTTTATAA CTGGCTTAAC AACCAAAAGC TACTAGAGAA TAAGATTGAT
CCATCCGCAG GATTTACTAG TGAGTACCTT GGATAA
 
Protein sequence
MNASNCKSMT RKGFLAAMGF STAGFCAGCS LEQPSAPSNG GNNGTGQNTG GTTEITFALD 
YTPNTNHTGI YVAQEKGYFD EVGLKVTIQQ PPADGADALI GAGGAQMGVT YQDYIANSLS
SSNPLPYTAV AAIIQHNLSG IMSREDDHIV RPRDLNNHTY ATWNLPVEQA TIRSVIESDG
GDPSTLKMVP YEVDDEVSGL KAKMFDAVWV YEQWAVQNAR VQNFAYNYFA FSAIDQNFDY
YTPVIAANDD FAKKNPDAVK AFLSATRKGY EFCVSNPDEA AEILLKAVPE LDADLVKASQ
KFLASKYIDD AEKWGVIDSA RWQRFYNWLN NQKLLENKID PSAGFTSEYL G