Gene Arth_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1378 
Symbol 
ID4446119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1535362 
End bp1536978 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content65% 
IMG OID639689189 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_830872 
Protein GI116669939 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTA TGGTTCCCGC GGTGAACGTT GCGGATCTCA AGGACACCAC CCTGCTGAAC 
ATGGGCATCT TTGGCCTGTT TGTGGCCATC ACCATGGTCA TCGTGATCAA GGCAAGCCGC
AACAACAAGA CGGCGGCGGA CTACTACGCC GCCGGACGTT CCTTCACCGG TCCGCAGAAC
GGCACCGCCA TTGCCGGAGA CTACCTCTCC GCGGCGTCGT TCCTCGGGAT CACCGGCGCC
ATCGCCGTCA ACGGCTACGA CGGCTTTATG TACTCCATCG GCTTCCTGGT CGCCTGGCTC
GTCGCCCTGC TGCTCGTGGC CGAGTTGCTC CGCAACACCG GCAAGTTCAC CATGGCCGAT
GTGCTCTCCT TCCGGCTCAA GCAGCGCCCG GTGCGCATCG CGGCCGCCAT CTCCACCCTG
GCGGTCTGCT TCTTCTACCT CCTGGCGCAG ATGGCCGGGG CGGGCAGCCT GATCTCCCTG
CTCCTCGGCA TCAGCGACTG GGGCGGACAG GCCCTGGTGA TCATCGTCGT CGGCGCCCTC
ATGATCATGT ACGTACTGAT CGGCGGCATG AAGGGCACCA CCTGGGTGCA GATCATCAAG
GCCATGCTGC TGATTGCCGG CGCCGCCGTG ATGACCTTCT GGGTCCTCGC CATCTACGGC
TTCAACCTCT CCGACCTGCT GGGCGGAGCG GTGGAAACGT CAGGCAACCC GAACATGCTC
AACCCGGGCC TGCAGTACGG CAAGACCGAA ACCTCCAAGC TGGACTTCAT GTCCCTGGGC
CTGGCGCTGG TGCTCGGCAC CGCCGCCCTG CCCCACGTGC TGATGCGCTT CTACACTGTT
CCCACCGCCA AGGAAGCCCG CAAGTCCGTG GTCTGGTCCA TCTGGCTGAT CGGCCTGTTC
TACCTGTTCA CCCTGGTCCT GGGCTACGGT GCCGCGGCAC TGGTCGGCGC TGACACCATC
AAGGGTGCCC CGGGCGGAGT CAACTCGGCC GCACCGCTGC TGGCGTTCCA CCTTGGCGGC
CCGCTGCTGC TGGGCTTCAT CTCTGCGGTG GCCTTCGCCA CCATCCTGGC GGTTGTCGCC
GGGCTCACCA TCACCGCGGC GGCATCGTTT GCGCACGACA TCTACGCCAG CGTGATTGCC
AAGGGGAAGG CCGACGCCGA CACGGAGGTC AAGGTCGCCC GGCGCACCGT GGTGGTCATC
GGCCTCCTGG CTATCGCCGG CGGCATCTTC GCCAACGGCC AGAACGTGGC GTTCCTCGTG
GCACTCGCCT TCGCTGTGGC GGCCTCGGCG AACCTGCCCA CCATCGTGTA CTCGCTTTTC
TGGCGGAAGT TCACCACCCA GGGTGCCATC TGGAGCATGT ACGGCGGACT TGGCTCAGCG
ATCATCCTGA TCGCACTGTC GCCTGTTGTC TCGGGGGCCA AGACGTCCAT GATCCCGGGC
GCCAACTTCG CGATCTTCCC GCTCAGCAAC CCCGGCATCG TCTCGATCCC GCTCGCATTC
TTGCTGGGCT GGCTAGGGTC GGTGCTGGAC AAGAAGCTGG AAGACACCAC CAAGCAGGCC
GAAATGGAAG TCCGCTCCCT GACCGGCGTG GGCGCAGAGA AGGCAGTCGA CCACTGA
 
Protein sequence
MTLMVPAVNV ADLKDTTLLN MGIFGLFVAI TMVIVIKASR NNKTAADYYA AGRSFTGPQN 
GTAIAGDYLS AASFLGITGA IAVNGYDGFM YSIGFLVAWL VALLLVAELL RNTGKFTMAD
VLSFRLKQRP VRIAAAISTL AVCFFYLLAQ MAGAGSLISL LLGISDWGGQ ALVIIVVGAL
MIMYVLIGGM KGTTWVQIIK AMLLIAGAAV MTFWVLAIYG FNLSDLLGGA VETSGNPNML
NPGLQYGKTE TSKLDFMSLG LALVLGTAAL PHVLMRFYTV PTAKEARKSV VWSIWLIGLF
YLFTLVLGYG AAALVGADTI KGAPGGVNSA APLLAFHLGG PLLLGFISAV AFATILAVVA
GLTITAAASF AHDIYASVIA KGKADADTEV KVARRTVVVI GLLAIAGGIF ANGQNVAFLV
ALAFAVAASA NLPTIVYSLF WRKFTTQGAI WSMYGGLGSA IILIALSPVV SGAKTSMIPG
ANFAIFPLSN PGIVSIPLAF LLGWLGSVLD KKLEDTTKQA EMEVRSLTGV GAEKAVDH