Gene Spro_2246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_2246 
SymbolaraG 
ID5605100 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp2447698 
End bp2449233 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content58% 
IMG OID640937785 
ProductL-arabinose transporter ATP-binding protein 
Protein accessionYP_001478475 
Protein GI157370486 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000548441 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCG CATTGCCGTA TTTGGCGTTT AAAGGCATAG GTAAAACCTT CCCGGGCGTG 
AAGGCGTTGG ATGACATCAG CTTTAGCTGT CAGGCCGGGC AGATCCACGC GCTGATGGGG
GAAAACGGCG CTGGGAAATC CACGCTGCTG AAAATTCTCA GTGGCAATTA CGCGCCAACC
CAGGGGGAAA TTCAGCTGCA GGGACGGTCG GTGCAGTTTG CCAACACCAC CGATGCGTTG
AATGCCGGGG TGGCGATTAT CTATCAGGAA CTGCATCTGG TGCCGGAAAT GACGGTGGCT
GAGAACATTT ATCTTGGCCA ACTGCCGACC AAACGCGGGC TGGTGGACAG AAAACTGCTG
CGCTATGAAT CCCGCTTGCA GCTCCAACAC CTCGGCCTGG ATATTGATCC GGATACGCCG
CTGAAATACC TGTCGATCGG CCAGTGGCAA ATGGTGGAAA TCGCCAAGGC GCTGGCGCGC
AACGCCAAGG TGATCGCCTT TGACGAACCC ACCAGTTCAC TTTCCGCCAG AGAGATTGAG
CAACTGTTCC GGGTGATCCG GGAATTACGT GCCGAAGGGC GGGTGATCCT GTATGTCTCG
CACCGTATGG AAGAAATTTT TGCCCTCAGC GACGCCATTA CCGTGTTTAA AGATGGCCGC
TACGTGCGCA CCTTTGACGA TATGCGGCAG GTGGATAACG CGCAGTTGGT GCAGGCGATG
GTCGGGCGCG ATCTGGGTGA CGTCTATGGC TACCAGCCAC GTGAGCTGGG GCCGGTGCGT
CTGGAGCTCA AGGGGCTGAA AGCACCGGGC GTCAAAACGG CGATCGATCT TAGCGTGCGG
GCCGGAGAGA TCGTCGGGCT GTTCGGCCTG GTGGGAGCCG GGCGCAGTGA ACTGATGAAG
GGGGTTTTTG GTGCCACACG GGTCAGTGCC GGCCAATTGA TGCTGGATGG ACAGGCGATC
GCCATTCGTT CACCGATTGA CGCGATCCGC GCCGGTATCA TGCTGTGCCC GGAGGATCGC
AAGGCGGACG GCATCATCCC GGTGCACTCG GTGCGTGACA ACATCAACAT CAGCGCAAGG
CGCAACAGCC TGCGCGCCGG TTGCCTGATC AACAAAGGGT GGGAGGCCAG CAATGCCGAT
CATCATATTC GTGCATTGAA TATCAAAACG CCTGGCCCTG AGCAGTTGAT TATGAATTTG
TCCGGCGGCA ATCAGCAGAA GGCCATTCTG GGTCGCTGGC TGTCGGAAGA GATGAAAGTG
ATCTTGCTCG ATGAACCAAC ACGCGGCATC GACGTCGGTG CCAAGCATGA AATTTATCAC
GTCATTTACC AACTGGCGCA GCGCGGCATT GCGGTGCTGT TCGCCTCCAG TGACCTGCCA
GAGGTGCTGG GGCTGGCTGA CCGTATCCTG GTGATGCGTG AAGGCGCACT GTCCGGCGAA
TTACGGCATG ACGAGGCCAG TGAGGAAAAA GCCCTCAGCC TGGCGATGCT GCGCACCCCC
GATATAGCCC CAGATGCCGC TGCGGCGGTG GCCTGA
 
Protein sequence
MTAALPYLAF KGIGKTFPGV KALDDISFSC QAGQIHALMG ENGAGKSTLL KILSGNYAPT 
QGEIQLQGRS VQFANTTDAL NAGVAIIYQE LHLVPEMTVA ENIYLGQLPT KRGLVDRKLL
RYESRLQLQH LGLDIDPDTP LKYLSIGQWQ MVEIAKALAR NAKVIAFDEP TSSLSAREIE
QLFRVIRELR AEGRVILYVS HRMEEIFALS DAITVFKDGR YVRTFDDMRQ VDNAQLVQAM
VGRDLGDVYG YQPRELGPVR LELKGLKAPG VKTAIDLSVR AGEIVGLFGL VGAGRSELMK
GVFGATRVSA GQLMLDGQAI AIRSPIDAIR AGIMLCPEDR KADGIIPVHS VRDNINISAR
RNSLRAGCLI NKGWEASNAD HHIRALNIKT PGPEQLIMNL SGGNQQKAIL GRWLSEEMKV
ILLDEPTRGI DVGAKHEIYH VIYQLAQRGI AVLFASSDLP EVLGLADRIL VMREGALSGE
LRHDEASEEK ALSLAMLRTP DIAPDAAAAV A