Gene Rfer_3607 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRfer_3607 
Symbol 
ID3962622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodoferax ferrireducens T118 
KingdomBacteria 
Replicon accessionNC_007908 
Strand
Start bp4018315 
End bp4019826 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content62% 
IMG OID637918426 
Productanthranilate synthase component I 
Protein accessionYP_524841 
Protein GI89902370 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00564] anthranilate synthase component I, non-proteobacterial lineages 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTCTG AACTCGAATT CAAAAGCCTG AGCGCCCAAG GCTACAACCG CATTCCCCTG 
ATGGCAGAAG CCTTTGCCGA TCTGGAAACC CCCTTGTCGC TGTACCTGAA GCTGGCGTAT
TCCAAAGACA GCGGCAAATA CAGCTTTCTG CTGGAGTCCG TCGTCGGCGG CGAGCGCTTC
GGCCGCTACA GCTTCATTGG CCTGCCGGCC CGCACGCTCT TACGCTCCAT GGGTTTTGGC
CCGGAGGTGT GTACCGAAGT GGTGACCGAC GGTGTCGTGG TGGAAACCTC GCGCGCCAAC
CCGCTGGATT TCATCTCGGA CTACCAGAAG CGCTTCAAAG TCGCGCTGCG CCCCGGTCTG
CCGCGTTTTT GCGGTGGCCT GGCCGGCTAC TTTGGCTACG ACGCCGTGCG CTACATCGAG
AAAAAGCTCG AAAAATCGTG CCCGCCCGAC ACCCTTGGCT GCCCCGACAT CCTGCTGTTG
CAGTGCGAAG AGCTGGCGGT GATCGACAAC CTGTCGGGCA AGCTGTACCT GATTGTCTAT
GCCGACCCGG CCCGCCCCGA GGCTTATGCC AACGCCAAGA AGCGCCTGCG CGAGTTGAAG
GAGCAGCTGA AATACTCGGT CAGCGCGCCG GTGGTCAAGC CGACGCAAAG CCACAGCGCC
GAGCGCGACT TTGCCAAGGC CGACTACCTT GCCGCCGTGG AGCGCGCCAA GGAGTTGATT
GCCGGCGGCG ACTTCATGCA GGTGCAGGTG GGCCAGCGCA TCAAGAAGCG CTACACCGAG
TCGCCGCTGA GCTTGTACCG GGCCTTGCGC GCGCTTAATC CCAGCCCGTA CATGTATTAC
TACCATTTCG GCGACTTCCA TGTGGTGGGC GCCTCACCCG AGATTCTGGT GCGCCAGGAG
CAGGTGACTG TTGACGGCAA GACCGAGCAG AAGGTCACCA TCCGGCCACT GGCCGGCACG
CGCCCGCGTG GCGCCAGCCT GGAGCTGGAC AAGGCGGCCG AGGTGGAGCT GATCAACGAC
CCGAAAGAGC GCGCCGAGCA TGTGATGCTG ATCGACCTGG CGCGCAACGA CATCGGCCGC
ATCGCCAAAA TCGGCAGCGT GAAAGTGACC GAAGCCTTCA GCGTCGAGCG CTACAGCCAT
GTGATGCACA TCGTCAGCAA TGTCGAAGGC ACCCTGAACG ATGGCATGAC CAGCATGGAC
GTGCTCAAAG CCACCTTTCC GGCCGGCACC CTGACCGGTG CGCCCAAGGT GCATGCCATG
GAGCTGATCG ACCAGTTGGA GCCGACCAAG CGCGGGGTTT ATGGCGGCGC CTGCGGTTAC
CTGAGTTACG CCGGCGACAT GGACGTGGCG ATTGCGATTC GCACCGGCAT CATCAAGGAC
CAGACACTCT ATGTTCAGGC GGCGGCCGGC ATCGTGGCCG ATTCGGTGCC GGAGCTGGAA
TGGAAGGAGA CCGAGGCCAA AGCGCGGGCA TTGCTGCGGG CGGCAGAGCT GGTGGAGGAA
GGGCTGGAGT GA
 
Protein sequence
MISELEFKSL SAQGYNRIPL MAEAFADLET PLSLYLKLAY SKDSGKYSFL LESVVGGERF 
GRYSFIGLPA RTLLRSMGFG PEVCTEVVTD GVVVETSRAN PLDFISDYQK RFKVALRPGL
PRFCGGLAGY FGYDAVRYIE KKLEKSCPPD TLGCPDILLL QCEELAVIDN LSGKLYLIVY
ADPARPEAYA NAKKRLRELK EQLKYSVSAP VVKPTQSHSA ERDFAKADYL AAVERAKELI
AGGDFMQVQV GQRIKKRYTE SPLSLYRALR ALNPSPYMYY YHFGDFHVVG ASPEILVRQE
QVTVDGKTEQ KVTIRPLAGT RPRGASLELD KAAEVELIND PKERAEHVML IDLARNDIGR
IAKIGSVKVT EAFSVERYSH VMHIVSNVEG TLNDGMTSMD VLKATFPAGT LTGAPKVHAM
ELIDQLEPTK RGVYGGACGY LSYAGDMDVA IAIRTGIIKD QTLYVQAAAG IVADSVPELE
WKETEAKARA LLRAAELVEE GLE