Gene Mlg_2701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2701 
Symbol 
ID4269945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp3063606 
End bp3065441 
Gene Length1836 bp 
Protein Length611 aa 
Translation table11 
GC content64% 
IMG OID638127462 
ProductNa+/Pi-cotransporter 
Protein accessionYP_743531 
Protein GI114321848 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1283] Na+/phosphate symporter 
TIGRFAM ID[TIGR00704] Na/Pi-cotransporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTCC GCGCGGTGGT GCTCGCCATT GTCCTGGGCA TCCTGGCCTG GGGCTTCTGG 
CAGAGTGGAG ATTTCACCGA GATCGCCGCC GGCGTGGCCA TCTTCCTGTT CGGCATGATG
TCGCTGGAGC AGGGTTTCCG CACCTTTACC GGCGGCACCC TGGAGACCCT GCTCGAGGCT
TCCACCAATC GGTTGTGGAA GAGCGTGGGC TTCGGCATCG CCAGCACCAC GCTGATGCAG
TCCAGCACCC TGGTCTCGCT GGTCACCATC TCCTTCGTCA GCGCCCAGAT GATCCCGCTG
GCCGCGGGCA TCGGCGTGGT GCTGGGGACC AACCTGGGGA CCACCACGGG TGCCTGGCTG
ATCGCCGGCC TGGGCCTGCG GGTGAACATC TCCGCCTACG CCATGCCGTT ACTGGTCTTC
GCCATCGTGC TGATGTTCCA GCGCGGTAAG ATGGCCAAGG GGGCGGGCAA TATCCTGCTG
GGCATCGGCT TTCTCTTCCT GGGTATCCAC TACATGAAGG AGGGCTTCGA CGCCTTCCAG
GAGACCTTCG ACCTGGCCGC CTACTCCATG GAGGGGATGG CCGGGCTGCT GGTCTACATC
GGCATCGGCA TGCTCATCAC GGTGATCATG CAGTCCAGCC ACGCCACCTT GCTGGTGGTG
ATCACCGCCC TGGCGGCCGG GCAGGTCACC TACGAGAACG GCCTGGCCCT GGCCATCGGC
GCCAACCTGG GGACGGCGGT GACCACGGCC CTGGGTGGCA TGACCGCGCA CCTGGGCGGC
AAGCGGCTGG CGGTGGCGCA CGTGGTCTTT AATATCGTGA CCGCCGTGGT GGCGGTGGCG
TTCATGGACT GGATCCGTTT GGGTGTGGAT TTCGGCGGCA ACCTGCTGGG CTTTGCCGAG
GACGACTTTC TGCTTCGCCT GGCGCTGTTC CATACCCTGT TCAATTTGTT GGGCGTGATG
ATCTTCGCGC CCTTCACCAA GCAGTTTGCC AGCCTGCTGG AGCACTATGT GACATTCGTC
TCCAAGCGCA CGGTCAGGCC GCAGTTCCTG CACAAGGACG CGCTGAAGGT GCCGGAGGTC
ACCGTGGCCG CAGTGCGCAA GGAGGTCTGG CACCTGTACG AGAATGCCTT TTCGCTGATC
ACCCACGGGC TCAGTCTGCG GCGCACGGTG GTTCGCTCCG AGCAGTCGTT GAGCGACGCT
GTGGCCCGTA CCCAGCGCAT CATGCCGCTG GATATCGATG ACGATTACGA GCAGCGGATC
AAGAGCCTGC AGAGCGCCAT CGTGGAATTC ATCAGCGAGA GCGGGACCAG TGGTGATACT
CCGGCCGCCG CCACCGAGCA GCTCTACGAA CTGCGCCACG CCAGCCAGAA TATTGTGCTG
GCGGTGAAGG ACATGAAGCA CTTGCACAAG AACCTGTCGC GGCTTGGCCT GTCCCGTAAC
CGCGCCATCC GCGAGCGCTA CGACGAGATC CGGCTGCTGA TTGCAGGGCT GTTGCGCGAG
ATTGAGCAAC TGCGCCAGGA GGAACCCGGA GCCTCCACCG TACTCGCGCT TGATGCCTAC
AAGGTCAGCG TGGAGCGCTT CTACCGGGGG TTCAGCGCCC GGCTGGAGGA GGCGATCAGG
GAGCGGCGCA TGCGCGGTGC CGAGGCCACC TCGCTGATGA ATGATGCGGG GTACGCCTAT
GATATTGCCC GTCTGCTTAT CGAGGCGGCG CAGATCTTGC TGGTGGCCAA GGAAAAGGAG
GTGCGCCTGG CGCAGAGTCA GGTCGCGCTC AGCGATGAAG AGATTCAGGC GGCTGTCGAG
GAGGTCTCCT CTGAGAGCAA GGGGCGGACG TTGTGA
 
Protein sequence
MSLRAVVLAI VLGILAWGFW QSGDFTEIAA GVAIFLFGMM SLEQGFRTFT GGTLETLLEA 
STNRLWKSVG FGIASTTLMQ SSTLVSLVTI SFVSAQMIPL AAGIGVVLGT NLGTTTGAWL
IAGLGLRVNI SAYAMPLLVF AIVLMFQRGK MAKGAGNILL GIGFLFLGIH YMKEGFDAFQ
ETFDLAAYSM EGMAGLLVYI GIGMLITVIM QSSHATLLVV ITALAAGQVT YENGLALAIG
ANLGTAVTTA LGGMTAHLGG KRLAVAHVVF NIVTAVVAVA FMDWIRLGVD FGGNLLGFAE
DDFLLRLALF HTLFNLLGVM IFAPFTKQFA SLLEHYVTFV SKRTVRPQFL HKDALKVPEV
TVAAVRKEVW HLYENAFSLI THGLSLRRTV VRSEQSLSDA VARTQRIMPL DIDDDYEQRI
KSLQSAIVEF ISESGTSGDT PAAATEQLYE LRHASQNIVL AVKDMKHLHK NLSRLGLSRN
RAIRERYDEI RLLIAGLLRE IEQLRQEEPG ASTVLALDAY KVSVERFYRG FSARLEEAIR
ERRMRGAEAT SLMNDAGYAY DIARLLIEAA QILLVAKEKE VRLAQSQVAL SDEEIQAAVE
EVSSESKGRT L