Gene Arth_1137 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1137 
Symbol 
ID4446370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1232662 
End bp1233732 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content63% 
IMG OID639688943 
Productbasic membrane lipoprotein 
Protein accessionYP_830631 
Protein GI116669698 
COG category[R] General function prediction only 
COG ID[COG1744] Uncharacterized ABC-type transport system, periplasmic component/surface lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCG TTGCCACCGC CGGGGCGGCC GCTCTCCTGC TGACCAGCTG CGGGGCTGCC 
CCGGAAGCGG GCAACACCGC TAGCGCCACC GCCAGCGACT ACACGGGCTG CATCGTGTCC
GACTCGGGTG GATTCGACGA CCAGTCGTTC AACCAGTCCT CCTACGAAGG CCTGAAGAAG
GCTGAGAAGG ATCTCGGGAT CAAGGTCAAC CAGGTCGAGT CCAAGACCAA CAACGACTTC
GAGCCGAACC TCCGCGCCAT GGTCACTGCA GGCTGCGACC TGACCGTCAC GGTCGGCTTC
CTCCTCGGCG ACGCCACCAA GGCCCAGGCC ACCGCCAACC CGGACAAGCA CTTCGCCATC
ATCGACTTCG GCTACGACAC CCCCATCACC AACGTCAAGC CGATCATCTA CGACACCGCC
CAGGCTGCCT TCCTGGCCGG TTACCTCGCG GCAGGCTCCA CCAAGACCGG AACGGTGGCG
ACCTTCGGCG GCATCAAGAT CCCCACTGTC ACCATCTTCA TGGACGGCTA CGCCGACGGC
GTGAAGTACT ACAACGAACA GAAGGGCAAG GACGTCAAGA TCCTTGGCTG GGACAAGGCG
AAGCAGGACG GCAGCTTCAC GGGCGACTTC GAAAAGCAGG ACAAGGGCAA GCAGCTGACC
CAGAACTTCC TGGACCAGGG CGCAGACATC GTGATGCCCG TTGCCGGCCC CGTCGGCAAG
GGCGCAGGCG CAGCACTCAA GGAAGCCAAG GCCGCAGGCA AGGACGTCAA ACTCATCTGG
GTTGACTCGG ACGGCTTCCT CACCGCCCCT GACTACAAGG ACATCATGCT CTCCTCCGTC
ATGAAGCAGA TGGGCGAAGC AGTGGAGACC GTCGTGAAGG AAGACAAGGA CGGCAAGTTT
AGCAACACGC CGTACGTCGG CACCCTCGCG AACGACGGCG TGCAGCTGGC TCCGTTCCAC
GATCTGGAGT CCCAGGTTCC CGCGGAACTG AAGACCGAAC TGGAACAGAT CAAGAAGGAC
ATCGTCGACG GCAAGCTGAA GGTCGAGTCG GCAGCGAGCC CGAAGGCCTA G
 
Protein sequence
MTGVATAGAA ALLLTSCGAA PEAGNTASAT ASDYTGCIVS DSGGFDDQSF NQSSYEGLKK 
AEKDLGIKVN QVESKTNNDF EPNLRAMVTA GCDLTVTVGF LLGDATKAQA TANPDKHFAI
IDFGYDTPIT NVKPIIYDTA QAAFLAGYLA AGSTKTGTVA TFGGIKIPTV TIFMDGYADG
VKYYNEQKGK DVKILGWDKA KQDGSFTGDF EKQDKGKQLT QNFLDQGADI VMPVAGPVGK
GAGAALKEAK AAGKDVKLIW VDSDGFLTAP DYKDIMLSSV MKQMGEAVET VVKEDKDGKF
SNTPYVGTLA NDGVQLAPFH DLESQVPAEL KTELEQIKKD IVDGKLKVES AASPKA