Gene Francci3_1862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1862 
Symbol 
ID3906137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2196012 
End bp2197109 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content67% 
IMG OID637879200 
Productaminotransferase 
Protein accessionYP_480967 
Protein GI86740567 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01141] histidinol-phosphate aminotransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAGTGA GATTCGCCGG CCGTGACCCG CTGCGGGACG TTACGGCCTA TCGGAGCACA 
GAGCCCGCGA ACACGGCGGC AAACGAGCCC GGGCGGTTCG ACCTGTCAAG CAACGAACTC
GTGCTTCCCC CACTGCCCCC ACTGCCCACA GTGCTCTCGG TCATCGAAGA TGGCCTCAGC
CGGCTGGCCC GTTACCCCGA CCCGACGGCA CGGACCATCA CAGAGGAGAT CGCGAAGCAC
CTGCACGTCT CCCCAGACGC GGTAGCGGTG GGCCCCGGAA GCGCCGGGGT ACTCCAACAG
ATTCTTCTCG CACTGTGCGG CGTCGGCGAC GAGGTCATCT ACGCGTGGCC GGGCTTCGAC
GCCTACCCGC TCCTGGTCGC CATCAGCGGG GCCACGAGCG TTCACGTTCC GCTCACGTCT
ACCGGTGATC ACGACCTGGA CGAGATACGT GCGCGGGTCG GCCCGCGCAC CAGGGTGATT
CTTCTCTGCT CTCCCCACAA TCCAACCGGC GTGGTCATTG ATCGCCATCG TCTGGACTCC
TTTCTGCGCT CACTGCCGGG CGACGTCCTC ACGGTGCTCG ACGAGGCCTA CGTCGAGTTC
GATCGCGGCG AGAATCCCCC GGGTACTCCG GATGTTCTCA GCCGGCACCG CAACATCGTC
GTGCTCCGGA CCTTTTCGAA GGCCTACGGG CTAGCGGGCC TGCGGATCGG CTACGCCGCG
GGCCCGGAGC GGATCATGGT GACCGTCCGC AAGGCCGGTC TTCCCTTTGG GGTCACCCAC
ATCGCGGAGC AGGCGGCGAT CCTCTCCCTG CACAGCGAGG GTGAGCTACG CGGTCGCCTG
GACACGGTGA CCGTGTCGCG AGACGAACTG ACCGCTGGTC TCCGGTGGTC GGGTCTGCCG
GTGCTGCCTT CTCGCGCCAA CTTCGTCTGG CTCCCCCTGG CCGCTGCGGC CGACTCCTTC
GCGCAGGACC TGGCGGAGGC AGGAATCAGG GTCCGCCCCT ATCCGGGGTA TGGCGTACGG
ATCTCCGTCG GAGCACCGGA AGCACAGGAG AGCCTCCTGA GGTCCCTTGG CCGAGGCGTA
CCCACGATCT GGGTGTGA
 
Protein sequence
MKVRFAGRDP LRDVTAYRST EPANTAANEP GRFDLSSNEL VLPPLPPLPT VLSVIEDGLS 
RLARYPDPTA RTITEEIAKH LHVSPDAVAV GPGSAGVLQQ ILLALCGVGD EVIYAWPGFD
AYPLLVAISG ATSVHVPLTS TGDHDLDEIR ARVGPRTRVI LLCSPHNPTG VVIDRHRLDS
FLRSLPGDVL TVLDEAYVEF DRGENPPGTP DVLSRHRNIV VLRTFSKAYG LAGLRIGYAA
GPERIMVTVR KAGLPFGVTH IAEQAAILSL HSEGELRGRL DTVTVSRDEL TAGLRWSGLP
VLPSRANFVW LPLAAAADSF AQDLAEAGIR VRPYPGYGVR ISVGAPEAQE SLLRSLGRGV
PTIWV