Gene Franean1_5722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5722 
Symbol 
ID5674048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6945719 
End bp6947059 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content70% 
IMG OID641244575 
Producthypothetical protein 
Protein accessionYP_001509978 
Protein GI158317470 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGG CGAGCTCGGC CCACCCGGAG AACCGCGAGC ACGTGTTCTA CTCCTGGGCG 
GCGCAGGAGA CGACGCACCC GCTCGTGGTC GCAGGGGCCG AGGGGAGCTG GTTCTGGGAC
AAGGGTGGCA CACGGTACCT CGACTTCTCC TCCCAGCTGG TGAACGCCAA CATCGGCCAC
CAGCATCCGG CGGTGGTCGA GGCGGTCGTG GCGCAGGCCC GGGAGCTGAC CACGGTCGGG
CCGCAGCACG CCCACGCCGT GCGCTCGGAG GCGGCGCGGC TGATCGCCGA GCGTGCTCCC
GGCGATCTGG ACCAGGTGTT CTTCACCACC GGGGGAGCGG AGGCCACGGA GAACGCCGTG
CGGCTGGCCC GGCTCGCCAC CGGGCGGAAC AAGATCCTCG CCGCCTACCG GTCGTACCAC
GGTTCGACCG GCGGCTCGAT CACGCTGACC GGGGAGCCGC GGCGTTGGGC GAGCGAACCG
GGTATTCCCG GTGTGGTGCA CTTCTTCGGC CCCTATCCGT ACCGGTCGTC TTTCCACGCC
GCCGACGCGG CTCAGGAGGG GGAGCGCGCG CTCGCGCACC TCGCAGAGGT GATCGAGCTG
GAGGGGCCGT CGGCGATCGC CGCGGTCATC CTCGAGCCGG TCGTCGGTAC GAACGGGATC
CTGGTGCCGC CGGACGGGTA CCTCGCGGGC GTCCGGGAGT TGTGCGACGC GCACGGAATA
CTGCTGATCG CCGACGAGGT GATGTCGGGT TTCGGGCGGT GCGGGGAATG GTTCGCGATC
GACCACTGGA ATGTGGTGCC CGATCTGATC TGTTTCGCGA AGGGCGTCAA CTCGGGTTAT
GTGCCGCTTG GCGGGGTGAT TATTTCGCAG CGGATCGCGG ACCTGTTCGC GCGGCGGCCG
TATCCTGGCG GGCTGACCTA TTCGGGGCAT CTGCTTGCCT GCGCGGCGGC GGTGGCGTCC
ATCAGGGCTT TCGAGTCCGA GGACATTCTC GGCCGGGCCC GTGCGCTCGG TTCCGAGGTT
ATCGGGCCGC AACTGGCTAA AATTGCCGCG CGGCATCCCA GCGTGGGTGA GGTGCGCGGG
CTCGGGGTGT TCTGGGCGGT CGAGCTCGTC CGTGACCGGG TTACCCGGGA GCCGTTGGTT
CCGTTCAACG CGGCGGGTGC CGCGGCCGCG CCGATGGCGG CCGTGACGGC GGCCTGCCGG
GAACGCGGCC TCTGGCCGTT CACCCATTTC AACCGGGTGC ATGTCGTGCC GCCGTGCACC
ACGAGTCCCG AGGATGTCTC CCTCGGCCTG TCGATCCTGG ACGAGGCCCT GGCCGTCGCG
GACGGCTGCT ACACCGGGTA A
 
Protein sequence
MTQASSAHPE NREHVFYSWA AQETTHPLVV AGAEGSWFWD KGGTRYLDFS SQLVNANIGH 
QHPAVVEAVV AQARELTTVG PQHAHAVRSE AARLIAERAP GDLDQVFFTT GGAEATENAV
RLARLATGRN KILAAYRSYH GSTGGSITLT GEPRRWASEP GIPGVVHFFG PYPYRSSFHA
ADAAQEGERA LAHLAEVIEL EGPSAIAAVI LEPVVGTNGI LVPPDGYLAG VRELCDAHGI
LLIADEVMSG FGRCGEWFAI DHWNVVPDLI CFAKGVNSGY VPLGGVIISQ RIADLFARRP
YPGGLTYSGH LLACAAAVAS IRAFESEDIL GRARALGSEV IGPQLAKIAA RHPSVGEVRG
LGVFWAVELV RDRVTREPLV PFNAAGAAAA PMAAVTAACR ERGLWPFTHF NRVHVVPPCT
TSPEDVSLGL SILDEALAVA DGCYTG