Gene Francci3_4054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4054 
Symbol 
ID3907015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4845643 
End bp4847265 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content71% 
IMG OID637881383 
Productaminotransferase 
Protein accessionYP_483133 
Protein GI86742733 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0160] 4-aminobutyrate aminotransferase and related aminotransferases 
TIGRFAM ID[TIGR00709] 2,4-diaminobutyrate 4-transaminases 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGTTT CCGCTCGTGA CGCCGCCGGC GGCTCAGCCG GCCGCACGGC GCCCCCCGCG 
ACCGTGCCTT CCGTAACCAC GAGAGCCACG ACGACGCCCC CTGCGACCAC GACCCCCCCG
ACCACGACCC CCCCGACCAC GACCCCCGTC GCCGCCAGGT CCACGCCTGC AGCCCCCACC
ACGGGTCCCA GGTCCGCACC CACGGACGCC GCGAACCCGG CCACGATCCT CGCGCGTCAG
CGTCAGCGCG AATCGGCGGC ACGGACCTAT GCGCGGACCC TCCCGATCGT GCCTGTCCGG
GCGTCCGGAA CCATGGTTCT CGGCGCGGAT GGCCGCCGCT ACCTGGACTG CCTGGCCGGC
GCCGGAACGC TCGCGCTGGG ACACAACCAT CCGGTGGTAA CCGAGGCGAT CACATCGATG
CTTGCCCGAG GCGCCCCGTT GCACACGCTC GACCTCGCCA CCCCCGAGAA GGACGCCTTC
ACCGACGAAC TCCGTGGCTG CCTGCCGGCC GGCATGGGGA GCGATGTGAA GTTGCACTTC
TGCGGCCCGA GCGGCGCCGA CGCGGTCGAG GCGGCCATCA AGCTCGCGCA GACGGTGACG
GGCAACCAGA CCATCCTCGC GTTCACCGGT GGATACCACG GAATGACGGC GGGTGCCCTG
GCCGTGACCG GGAACGTCGC CGTCAAAGAT CCTCTGCCGC CATCGACATC CGTCGTACGG
CTGCCGTTCC CCTTCCCCTA CCGGTGCCCG TTCGGGGTGG GGTCCGGTCG CGCCCCCGGT
CCGGACGGTG CCGAGTTGTC CGCCCGCTAC ATCGAACGTC TCCTCGATGA CCCGGCCGGA
GGGGTAGTCA GGCCGGCGGC ACTGATCGTA GAACCGGTTC AGGGTGAGGG CGGGGTGGTT
CCCGCTCCTG ACGGCTGGCT CCGCGCCATC CGCGAGATCA CCCGTCGCCG CGGCATCCCA
CTGATCCTGG ACGAGGTCCA GACGGGAGTC GGTCGCACCG GGAAGTACTG GGCCGCGCAG
CACAGCGGCA TCACCCCCGA CATCATCGTC ATGTCCAAGG CGATCGGCGG CGGGCTGCCA
CTGGCCGTCA TCGCCTACCG GTCTGAGCTG GACACCTGGG CGCCCGGGGC ACATACCGGC
ACCTTCCGTG GAAACCAGCT GGCGATGGCC GCGGGCCATG CCACGCTGCG CCTGGTGCGA
GAGGACCGTC TCGACGAACG CGCCGCCCGA CTCGGCACGC GCCTACTCAC CGGCCTGGCA
GAGATCGCCC GGAACCGGCC GCAGGTGGGC GACGTACGCG GCCGCGGGCT GATGCTCGGC
GCCGAGATGG TCGATCCGAC CGCAGCCCCG GACCTCGTCG GCGCGCATCC CGCCGACGCG
AGACTCGCCG GTCTGGTACG TGCCGAATGT CTGCGCCGCG GCCTGATCAT CGAGCTCGGT
GGGCGGCACG GCGCGGTCGT CCGGCTGTTG CCACCGCTGA TCCTCAGCGA TGAGGAGGGC
GATCGCGTCC TCGCGATCCT CGCCGACGCC ATCGATGCCG CCGTCCGCAG GCTCTCCGTC
CACGCGGCTT TGGGACACCC ACCCGCCGAT CAGGCCCACG TCCGGTTCGG ATCCGCCGGA
TGA
 
Protein sequence
MTVSARDAAG GSAGRTAPPA TVPSVTTRAT TTPPATTTPP TTTPPTTTPV AARSTPAAPT 
TGPRSAPTDA ANPATILARQ RQRESAARTY ARTLPIVPVR ASGTMVLGAD GRRYLDCLAG
AGTLALGHNH PVVTEAITSM LARGAPLHTL DLATPEKDAF TDELRGCLPA GMGSDVKLHF
CGPSGADAVE AAIKLAQTVT GNQTILAFTG GYHGMTAGAL AVTGNVAVKD PLPPSTSVVR
LPFPFPYRCP FGVGSGRAPG PDGAELSARY IERLLDDPAG GVVRPAALIV EPVQGEGGVV
PAPDGWLRAI REITRRRGIP LILDEVQTGV GRTGKYWAAQ HSGITPDIIV MSKAIGGGLP
LAVIAYRSEL DTWAPGAHTG TFRGNQLAMA AGHATLRLVR EDRLDERAAR LGTRLLTGLA
EIARNRPQVG DVRGRGLMLG AEMVDPTAAP DLVGAHPADA RLAGLVRAEC LRRGLIIELG
GRHGAVVRLL PPLILSDEEG DRVLAILADA IDAAVRRLSV HAALGHPPAD QAHVRFGSAG