Gene YpAngola_A1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A1037 
SymbolaroP2 
ID5799500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp1061631 
End bp1063028 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content48% 
IMG OID641339025 
Productaromatic amino acid transporter 
Protein accessionYP_001605597 
Protein GI162420665 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.725191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.852119 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATC AACAAGAGGG TGCTGAGCTA AAACGGGGGC TTAAAAACCG CCATATTCAG 
CTTATTGCCC TAGGTGGTGC AATTGGTACC GGACTATTCC TCGGCATAGC ACAGACCATC
AAAATGGCTG GGCCTTCGGT TTTACTGGGG TACGCAATTG GGGGTTTTAT TGCGTTTCTG
ATAATGCGCC AGCTAGGCGA AATGGTGGTT GAAGAACCTG TAGCCGGTTC CTTTAGCCAC
TTTGCGTATA AATATTGGGG ACACTTTGCC GGTTTTGCTT CTGGCTGGAA CTACTGGGTG
CTGTATGTGT TGGTGGCGAT GGCCGAACTA ACCGCAGTGG GGATCTATGT GCAATATTGG
TGGCCAGAAA TCCCTACCTG GGTCTCCGCC GCCGTCTTCT TCTTGGCCAT CAACGCCATC
AACCTGGCTA ACGTAAAAGT CTATGGTGAG ATGGAATTTT GGTTTGCCAT CATTAAAGTG
ATCGCGATTA TTGCGATGAT TTTATTTGGC GGTTACCTGC TCATCAGTGG CCGGGGTGGC
CCAGAAGCCA CGGTAACCAA CTTATGGGCC CAAGGCGGTT TCTTCCCGAA TGGCATCATG
GGTCTGGTGA TGGCAATGGC GGTAATTATG TTCTCTTTCG GTGGCCTTGA ATTAGTGGGT
ATCACCGCAG CAGAAGCAGA AGACCCGGCC AAAAGCATTC CGAAGGCAAC CAATCAGGTT
ATCTACCGTA TTCTTCTGTT TTATATTGGT TCTCTGGCAA TCTTGTTATC ACTCTACCCA
TGGGGAAAAG TGGTCGAAGG CGGCAGCCCA TTCGTATTGA TTTTCGATGC GCTGGACAGT
AATTCAGTCG CCACTGTCTT GAATATTGTC GTACTGACGG CGGCACTCTC GGTCTACAAC
AGTTGCGTAT ACTGTAACAG CCGCATGTTG TTTGGTTTAG CTAAACAAGG TAATGGCCCG
AAAATCCTGT TGAAAGTGGA TGGCCGAGGT GTTCCAGTCA TTGCGATTGC TGTTTCTGCG
TTTGCTACCG CGTTTTGTGT ACTGATTAAC TACCTGTTAC CTGGCCGTGC CTTTGAATTA
CTGATGGCAT TAGTGGTATC CGCGTTGGTG ATCAACTGGG CGATGATTAG CCTGGCACAC
CTGAAATTCC GTGCGGCGAA AAACCGCCAG GGCGTAATAC CAAAATTCAA AGCATTTTGG
TATCCGTTCG GTAACTGTTT GTGTTTGTTG TTCCTGACCG GCATCTTAGT GATCATGTTT
CTGACACCCA GCATCCGGAT TTCAGTGATA CTTATTCCTG TCTGGGTAGT CGTCCTAGCG
ATTGGTTATA TTCTGAAGAA TCAGAGCCAA CGTCAGAATC AGCAACTGAG CGCTAGCAGC
AGGAAAGTAA CCAAGTAA
 
Protein sequence
MSDQQEGAEL KRGLKNRHIQ LIALGGAIGT GLFLGIAQTI KMAGPSVLLG YAIGGFIAFL 
IMRQLGEMVV EEPVAGSFSH FAYKYWGHFA GFASGWNYWV LYVLVAMAEL TAVGIYVQYW
WPEIPTWVSA AVFFLAINAI NLANVKVYGE MEFWFAIIKV IAIIAMILFG GYLLISGRGG
PEATVTNLWA QGGFFPNGIM GLVMAMAVIM FSFGGLELVG ITAAEAEDPA KSIPKATNQV
IYRILLFYIG SLAILLSLYP WGKVVEGGSP FVLIFDALDS NSVATVLNIV VLTAALSVYN
SCVYCNSRML FGLAKQGNGP KILLKVDGRG VPVIAIAVSA FATAFCVLIN YLLPGRAFEL
LMALVVSALV INWAMISLAH LKFRAAKNRQ GVIPKFKAFW YPFGNCLCLL FLTGILVIMF
LTPSIRISVI LIPVWVVVLA IGYILKNQSQ RQNQQLSASS RKVTK