Gene Rru_A3074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3074 
Symbol 
ID3836520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3539235 
End bp3540776 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content71% 
IMG OID637827189 
Productglycosyltransferase 
Protein accessionYP_428156 
Protein GI83594404 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0020891 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGCTC CCTGGATCCA ATGCGCCACG GGAACGGCCG ATCGCGGCCG GGCGCTGTTG 
TATTCCCGCT GGGTCGGGCT GGCGGTGCTC TGCCTGCTGT GCCTCGTCCG CCTGTTGCTG
AGCGCCGGCG GCACCATCCC CCCCCATCCC GACGAAGCGA TGCTCTGGGC CGCCGGTCAG
CCGTTGGGGG CCGTCCTGGG CGGCGACCAC CCTTTGGTTT CGCTGATCCT GTCGGCCAGC
GTCGAGCTTC TGGGCAATAC GGTTTTCGCC CTGCGCGCCC CCGGGGTGAT CGCCCTTGGC
CTTGCCAGCA TCGCCGTCTG GCGGCTGGCC GGGCTGCTCT ATGACGCCCG GGTGGCTTTT
TGGGCCGCCG TGGTCTTCGC CACCCTGCCG GTGGTCTCCT ATGCCTCGGC CATCGCCGGA
ACGGCGGGGT TCCTGCCGCT GTTCTGGGCG GTGGCCCTGC ATGGGCTGCT GCGCGGCCTG
AAGAGCGATT CCCTGGTTTG GTGGCTTGTT TTGGGCACGG CCTTCGGGCT TGGCCTGTTG
ACCGACGGGG CGATGGCGCT TTTGGTGCCG TGCTTCCTGC TCTATGGCCT GCTGTCGCCC
GAATACCATG CCCTGTGGCG GCGCCGGGGC CTGTGGCTGG CTTTGGGTCT GGGCTTGGCG
ATCGCCGCCC CCGCCTTTTG GGCCGGTCTT CTGCACGCCG ATCTCACCCC CCAGCCGACG
CCGGCGGCGG GCTTCGCCTT CCTGGTGGCC CAGGTCGCGG TTTTCGGGCC AATTCCGGCC
ACCGTGCTGG CCTGGGTGGC GCTGCATCCC GGCGGCGGGG CGATGATCGG CGGATACGGG
CCCGGTGACG AGCGCCGCGC CGCCGACGAG CGGGCGCGGC GCGGCTATCG CATCCGGTTC
TTTCTGTCGT TCAGCCTGCC GGTGGTGGCC CTGGCCACCC TCGCCGCGGC GATGGGCGGC
GCCCTGCCGG CGGAGGCGGC GGCGGTGGCC TATGTCGGCG GCGCCATCCT GGTGGCCTCG
TGGCTGCTGA CGACGCCGTT GCGTCGCGGC CTGCTGCGGC TTTGCGTGGC GCTGCATATC
CTGGGCGCCC TGCTGTTCTT CAATCTGGAT GGCCTGCTGC GCGACAGCGG CCTGCGCCCG
CCCGCGGGGC TTGACCCCTT CGCCGATCTG CGCGGCATGG ACCGGGTGGC GGTTTGGGGC
GGTGAACTGG CGGCGCGCTA TCCCGGGGTG CCGATGATCT TCGATGACCC GGGCATTCTG
GCCAGCCTGC GCTTCCAGAG CCATCCCCGC TCCACCGTCA TGGTGCTGGC CAGCGCTCTG
GGCGGAGCGG ATGCCATCGG TTTGGGACCG GCGCCCAACG GCATTCTGGT GATCACCCGC
GCCCCGACCG GGCCCGATAC GCCCACTCCC GATACGCCCG CCGCCGATAC TTCCAATGAC
GAGGGCGGGC GCGATGCCGG CTTCGTTGAT ATCGAAGCCG TTCCCGGGCG GTGGATATCG
CTGCGCGCGC GCTTTCTGCC GCCCGCCGGA GAGCAACCAT GA
 
Protein sequence
MDAPWIQCAT GTADRGRALL YSRWVGLAVL CLLCLVRLLL SAGGTIPPHP DEAMLWAAGQ 
PLGAVLGGDH PLVSLILSAS VELLGNTVFA LRAPGVIALG LASIAVWRLA GLLYDARVAF
WAAVVFATLP VVSYASAIAG TAGFLPLFWA VALHGLLRGL KSDSLVWWLV LGTAFGLGLL
TDGAMALLVP CFLLYGLLSP EYHALWRRRG LWLALGLGLA IAAPAFWAGL LHADLTPQPT
PAAGFAFLVA QVAVFGPIPA TVLAWVALHP GGGAMIGGYG PGDERRAADE RARRGYRIRF
FLSFSLPVVA LATLAAAMGG ALPAEAAAVA YVGGAILVAS WLLTTPLRRG LLRLCVALHI
LGALLFFNLD GLLRDSGLRP PAGLDPFADL RGMDRVAVWG GELAARYPGV PMIFDDPGIL
ASLRFQSHPR STVMVLASAL GGADAIGLGP APNGILVITR APTGPDTPTP DTPAADTSND
EGGRDAGFVD IEAVPGRWIS LRARFLPPAG EQP